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HUMAN SULFONYLUREA RECEPTOR SURH 
TECHNICAL FIELD 

The present invention relates to nucleic acid and amino acid sequences of a novel human 
sulfonylurea receptor and to the use of these sequences in the diagnosis, study, prevention and 
5 treatment of disease. 

BACKGROUND ART 

ATP-dependent potassium (Ky^jp) channels serve to couple metabolic state to electrical 
activity in many types of ceils. By hyperpolarizing the cell. K^-pp channels limit eiecrricai activity 
and hence reduce Ca-' entry into muscle and nerve cells. In the pancreas, they are a critical link 
10 between blood glucose concentration and insulin secretion. 

Sulfonylureas (SUs) are oral hypoglycemics widely used in the treatment of non-insulin 
dependent diabetes mellitus (NIDDM). SUs stimulate insulin release from pancreatic islet b cells. 
The mechanism for insulin release involves 1 ) inhibition of a K.^yp channel which sets the b cell 
resting membrane potential. 2) reduction of K' outflow which causes b cell depolarization and 3) 
15 the activation of one or more voitagc-dependent L-type calcium channels which results in Ca-' 
influx, exocyiosis. and insulin release, SUs such as tolbutamide or glyburide decrease K^^-p 
channel activity, thereby depolarizing the cell and triggering insulin release. 

Until recently the K^yp channel and the sulfonylurea receptor (SUR) were thought to be 
the same molecule ( Aguilar-Br>*an et al (1995) Science 268:423-426); however, SUR does not 
20 possess intrinsic K' channel activity ( Ammala C et al (1996) Nature 379:545-548), Instead SUR 
interacts with inward-rectifier K' channels, conferring SU and ATP sensitivity to and modulating 
the activuy of these channels (Inagaki N et al (1995) Science 270:1 166-1 170). 

A second isoform of SUR. denoted SUR2. has recently been discovered in rat. This 
isoform has different tissue distribution and different SU and ATP binding properties from rat 
25 SUR (Inagaki N et al ( 1 996) Neuron 16:1011-1017). The channel kinetics of Kir6.2- an mvvard- 
rectifier K' channel, co-expressed with SUR2 are different than the channel kinetics of Kir6.2 co- 
expressed with SUR. Based on these observations, it is suggested that a family of structurally 
related but functionally distinct SURs determine the ATP sensitivity and pharmacological 
responses of K^yp charmeis in various tissues (Inagaki N et al (1996), supra). 
30 SURs from rat and hamster consist of 1581 and 1582 amino acids. respectivel\-. with 12 

potential membrane-spanning helices (Aguiiar-Br\'an et ah supra). In addition, the proteins 
contain two domains havmc strong similarii>' to the nucleotide hindmg folds (TslBFsi of the .ATP- 
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binding cassette (ABC) superfamily of proieins. The proposed topology of the rat. hamster, and a 
recently reported human SIJR (GenBank GI 1369844: unpublished) consists of an exiemai amino 
terminus, nine predicted transmembrane hcJices, the first cytosolic NBF (NBF-1 four more 
transmembrane helices, the second cytosolic NBF (NBF-2) and a cytosolic C-terminus. The 

5 topology of the SURs are similar to other members of the ABC superfamily including multidrug 
resistance (MDR) proteins and cystic fibrosis transmembrane regulators (CFTR; Philipson LH 
and Steiner DF (1995) Science 268:372-373). 

The NBFs of ABC superfamily proteins control activity through their interaction with 
cytosolic nucleotides. In cystic fibrosis, the more frequent and severe disease mutations are 

1 0 located in the nucleotides encoding the two NBFs of the CFTR protein (Tsui L-C (1 992) Trends 
Genet 8:392). Familial persistent hyperinsulinemic hypoglycemia of infancy (PHHI) may be 
caused by mutations affecung NBF-2 of SUR (Thomas PM et al (1995) Science 268:426-429). 

SU-sensitive K^-^p chatmcls are present in brain cells and play a role in neurosecretion at 
ner\'e terminals. Kj^j^ channels in the substantia nigra, a brain region that shows high SU 

15 binding, are inhibited by high glucose concentrations and antidiabetic SUs. and are activated by 
ATP depletion and anoxia. Furthermore, inhibition of the K^vip channel activates 
gamma-aminobutyric acid (GABA) release, whereas K^j,, channel activation inhibits GABA 
release (Amoroso S et al (1990) Science 247:852-854: Schmidt- Antomarchi et al (1990) Proc 
Natl Acad Sci USA 87: 3489-3492). 

20 Action potentials in cardiac cells are modulated by SU compounds binding to SURs. The 

durauon of the action potential of guinea pig cardiac cells was drastically reduced by decreasing 
intracellular ATP concentrations ([ATP],^) by perfusion or by blockade of oxidative 
phosphor\'lation. Glibenclamide. an SU compound, was found to restore normal or nearly 
normal action potentials in these [ATP],n-depleted cardiac cells. <Fosset M et al (1988) J Biol 

25 Chem 263:7933-7936). Restoration was attributed to inhibition of cardiac K.^^p channels by 
sulfonylurea compounds acting via the SURs. 

SURs confer ATP and SU sensitivity to inwardly-rectifying potassium channels, thereby 
coupling metabolic state to electrical activity in tissues such as brain, pancreas, and heart. SURs 
are useful in the diagnosis and treatment of diseases related to abnormal K^^p channel function. 

30 such as NIDDM and PHMI. The selective modulation of the expression or activities of SURs 
may allow the successful manatzement of such diseases. 
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DISCLOSURE OF THE INVENTION 

The present invention discloses a human sulfonylurea receptor protein, hereinafter 
referred to as SURl I. having chemical and structural homology to the SUR protein from rat and 
hamster. Accordingly, the invention features a substantially purified SURH. having the amino 
5 acid sequence of SEQ ID NO: 1 and the structural characteristics of SURs, 

One aspect of the invention features isolated and substantially purified polynucleotides 
which encode SURH. In a panicular aspect, the polynucleotide is the nucleotide sequence of SEQ 
ID N0:2. In another aspect, the polynucleotide is the nucleotide sequence extending from T^ygo 
to of SEQ IDN0:2, 
10 The invention also relates to a polynucleotide sequence comprising the complement of 

SEQ ID N0:2 or variants thereof In addition, the invention features nucleotide sequences which 
hybridize under stnngent conditions to SEQ ID N0:2. 

The present invention also relates to an expression vector which contains polynucleotides 
encoding SURH, and the use of said vector to transform or transfect host cells or organisms. The 
15 invention also features methods for producing SURH. The present invention also relates to 
antibodies which bind specifically to SURH polypeptides, and to agonists and antagonists of 
SURH. The present invention also relates to pharmaceutical compositions comprising SURH. 
fragments thereof agonists of SURH. or antagonists of SURH, in conjunction with a suitable 
pharmaceutical carrier. 
20 BRIEF DESCRIPTION OF DRAWINGS 

Figures 1 A- 1 M show the amino acid sequence (SEQ ID NO; 1 ) and the nucleic acid 
sequence (SEQ ID N0:2) of the human sulfonylurea receptor protein SURH. produced usmg 
MacDNAsis software (Hitachi Software Engineering Co Ltd), 

Figures 2A-2H show the amino acid sequence alignments among SURH (SEQ ID NO:l ). 
25 a human SUR isoform (GI 1369844; SEQ ID N0:3). SUR from Norway rat (GI 131 15343: SEQ 
ID N0:4). and SUR from black-bellied hamster (GI 784874; SEQ ID NO:5 ) produced using the 
multisequence alignment program of DNAStar software (DNASiar Inc. Madison Wl). 

MODES FOR CARRYING OUT THE INVENTION 

Definitions 

30 '^Nucleic acid sequence" as used herem refers to an oligonucleotide, nucleotide or 

polynucleotide, and fragments or portions thereof and to DNA or RNa\ of genomic or synthetic 
origm which may be single- or double-strandcJ. and represent the sense or antisense strand. 



BNSDOCID: <WO 9814571A1 J_> 



wo 98/14571 



PCTAJS97/17744 



Similarly, amino acid sequence as used herein refers to peptide or protein sequence. 

"Consensus" as used herein may refer to a nucleic acid sequence 1 ) which has been 
resequenced to resolve uncalled bases, 2) which has been extended using XL-PCR (Perkin Elmer. 
Norwalk CT) in the 5' and/or the 3' direction and resequenced, 3) which has been assembled 
5 from the overlapping sequences of more than one Incyte clone GCG Fragment Assembly System. 
(GCG. Madison Wl). or 4) which has been both extended and assembled, 

''Peptide nucleic acid" as used herein refers to a molecule which comprises an oligomer to 
which an amino acid residue, such as lysine, and an amino group have been added. These small 
molecules, also designated anti-gene agents, stop transcript elongation by binding to their 
10 complementary (template) strand of nucleic acid (Nielsen PE et ai (1993) Anticancer Drug Des 
8:53-63). 

A "variant" of SURH is defined as an amino acid sequence that is different by one or 
more amino acid substitutions. The variant may have **conservative'' changes, wherein a 
substituted amino acid has similar structural or chemical properties, eg, replacement of leucine 

1 5 with isoleucine. More rarely, a variant may have "nonconservative** changes, eg. replacement of 
a glycine with a ir\'ptophan. Similar minor variations may also include amino acid deletions or 
insertions, or both. Guidance in determining which and how many amino acid residues may be 
substituted, inserted or deleted without abolishing biological or immunological activity may be 
found using computer programs well known in the an. tor example. DNAStar software. 

20 A "deletion" is defined as a change in either nucleotide or ammo acid sequence in which 

one or more nucleotides or amino acid residues, respectively, arc absent. 

An "insertion" or "addition" is that change in a nucleotide or ammo acid sequence which 
has resulted in the addition of one or more nucleotides or amino acid residues, respectively, as 
compared to the naturally occurring SURH. 

25 A "substitution" results from the replacement of one or more nucleotides or amino acids 

by different nucleotides or amino acids, respectively. 

The term "biologically active" refers to a SURH having structural, regulatory or 
biochemical functions of the naturally occurring SURII. Likewise, "immunologically active" 
defines the capability of the natural, recombinant or synthetic SURH. or any oligopeptide thereot\ 

30 to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

The term "agonist" refers to a molecmc ulnch. u hen hound to SURH. causes a change in 

-4- 
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SURH which modulates the biological activity of SURil. The term "antagonist" refers to a 
moiecuie which, when bound to SURJ L blocks the binding of an agonist to SURH, which 
prevents the agonist-induced change in the biological activity of SURH. Agonists and 
antagonists may include proteins, nucleic acids, carbohydrates, or other molecules which bind to 
5 SURH. 

The term "modulate" as used herein refers to a change or an alteration in the biological 
activity of SURH. Modulation may be an increase or a decrease in biological activity, a change 
in binding characteristics, or any other change in the biological propenies of SURH. 

The term "derivative" as used herein refers to the chemical modification of a nucleic acid 
10 encoding SURH or the encoded SURH. Illustrative of such modifications would be replacement 
of hydrogen by an alkyl, acyl. or amino group. A nucleic acid derivative would encode a 
polypeptide which retains essential biological characteristics of natural SURH. 

As used herein, the term ''substantially purified" refers to molecules, either nucleic or 
amino acid sequences, that are removed from their natural environment, isolated or separated. 
15 and are at least 60% free, preferably 75% free, and most preferably 90% free from other 
components with which they are naturally associated. 

''Stringency" typically occurs in a range from about Tm-5°C (5''C below the Tm of the 
probe) to about 20''C to 25X below Tm. As will be understood by those of skill in the art. a 
stnngency hybridization can be used to identify or detect identical polynucleotide sequences or to 
20 identify or detect similar or related polynucleotide sequences. 

The term "hybridization" as used herein shall include "any process by which a strand of 
nucleic acid joins with a complementarv' strand through base pairing" (Coombs J (1994) 
PjctiQuary of B ig^echnplp gy. Stockton Press, New York NY). Amplification as carried out in the 
polymerase chain reaction technologies is described in Dieffenbach C\V and GS Dveksler (1995, 
25 PCR Primer , a LaboratoPr^ Manual . Cold Spring Harbor Press. Plainview NY). 
Preferred Embodiments 

The invention relates to a human sulfonylurea receptor protein, SURH. initially identified 
among the cDNAs from a library constructed from human brain tissue (BRAINOT03) and to the 
use of the nucleic acid and amino acid sequences disclosed herein in the study, diagnosis. 
30 prevention and treatment of disease. Northern analysis using the LIFESEQ"^^^ database (Incyte 
Pharmaceuticals, Palo Alto CA) indicates that SURH-encoding nucleotide sequences are most 
abundantly transcribed in brain, and are also found in pancreas, breast, uterus and prostate. It 

-5- 

BNSDOCID: <WO 9614571A1 | > 



wo 98/14571 



PCT/US97/17744 



must be noied that naiurally occurring expression of SURPl is not necessarih iimited to these 
tissues. 

The invention also encompasses SURH variants. A preferred SURH variant is one 
having at least 80% amino acid sequence similarity to the SURH ammo acid sequence (SEQ ID 
5 NO: 1 ), a more preferred SURH variant is one having at least 90% amino acid sequence similarity 
to SEQ ID NO: 1 and a most preferred SURH variant is one having at least 95% amino acid 
sequence similarity to SEQ ID NO: 1 . 

The nucleic acid sequence encoding SURH was first identified in the cDNA. Incyte Clone 
662342. through a computer-generated search for amino acid sequence alignments. The 

10 consensus nucleotide sequence. SEQ ID NO:2, disclosed herein (Figures 1 A-IG) encodes the 
amino acid sequence. SEQ ID NO: I. designated SURH. The consensus nucleotide sequence was 
extended and assembled from Incyte Clones 1270543 (BRAINOT09); 1332410 (PANCNOT07): 
640147 (BRSTNOT03): 641239 (BRSTNOT03); 662342 (BRAiNOT03): and 952281 
(PANCNOT05) from the LIFESEQ™ database (Incyte Pharmaceuticals. Palo Alto CA). 

15 The present invention is based in part on the chemical homology shown in Figures 2A- 

2H. among SURJH and an SUR isoform from human (GI 1369844, unpublished), and SUR 
homologs from rat (GI 131 15343; Aguilar-Bryan et ah supra) and hamster (GI 784874: Aguilar- 
Br>'an et ai. supra). The human isoform. the rat homolog and the hamster homolog have, 
respectively, 96%, 92% and 90% amino acid sequence identity to SURH. 

20 The SURH protein sequence consists of 1 580 amino acids. From the amino acid 

sequence alignments (Figures 2A-2H) and its hydrophobicity, SURH contains twelve potential 
membrane-spanning helices located at or near residues 31oK 75-94. 135-155, 169-189, 306-323, 
350-368,448-469, 540-560.576-596, 1002-1020. 1063-1076, 1154-1172. and 1276-1296. 
Furthermore, the protein contains two nucleotide binding fold (NBF) domains encompassing 

25 residues 695-893 (NBF-1 ) and 1356-1534 (NBF-2). The predicted topology of SURH consists of 
an extracellular ammo terminus, nine transmembrane helices, cytosolic NBF-1. four 
transmembrane helices and cytosolic NBF-2 culminating at the cytosolic C-termmus. In 
addition, the SURH protein sequence contams 1 1 potential N-glycosylation sues. Four of these 
potential N-glycosylation sites, Njo^ N^o^- ^^^04H and N,o58, reside on predicted extracellular- facing 

30 surface loops. As shown is Figures 2A-2H, the SURH amino acid sequence at positions 913 to 
923, 925 to 943 and 952 to 960, in the region between NBF-1 and the tenth predicted 
transmembrane helix, have no identity with the corresponding amino acid residues in a human 

-6- 
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SUR isoform (GI 1369844). 
The SURH Coding Sequences 

The assembled nucleic acid and deduced amino acid sequences of SURH are sho\Nai in 
Figures 1 A-IM. In accordance with the invention, any nucleic acid sequence which encodes the 
5 amino acid sequence of SURH can be used to generate recombinant molecules which express 
SURH. In a specific embodiment described herein, a partial sequence encoding SURH was first 
isolated as Incyte Clone 662342 from a human brain tissue cDNA library (BRAINOT03). 

As noted above, there is minimal amino acid sequence identity between a human SUR 
isoform (GI 1369844) and SURH between amino acids S^^j to R^^o of SEQ ID NO:l; which 

10 corresponds to nucleotides T2780 A:923 of SEQ ID NO:2. Oligonucleotides complemcmary to 
this region of SEQ ID N0:2 are highly specific for the SURH of the present invention. Such 
oligonucleotides are useful for diagnostic and therapeutic applications specific to SURH. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of SURH-encoding nucleotide sequences, some bearing minimal 

15 homology to the nucleotide sequences of any icnown and naturally occurring gene, may be 

produced. The mvention contemplates each and every possible variation of nucleotide sequence 
that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
nucleotide sequence of naturally occurring SURH, and all such variations are to be considered as 

20 being specifically disclosed. 

Although nucleotide sequences which encode SURH and its variants are preferably 
capable of hybridizing to the nucleotide sequence of the naturally occurring SURH under 
appropriately selected conditions of stringency, it may be advantageous to produce nucleotide 
sequences encoding SURjhl or its derivatives possessing a substantially different codon usage. 

25 Codons may be selected to increase the rate at which expression of the peptide occurs in a 

panicular prokaryotic or eukaryotic expression host in accordance with the frequency with which 
panicuiar codons are utilized by the host. Other reasons for substantially altenng the nucleotide 
sequence encoding SURH and its derivatives without altering the encoded amino acid sequences 
include the production of RNA transcripts having more desirable properties, such as a greater 

30 half-life, than transcripts produced from the naturally occurring sequence. 

It is now possible to produce a DNA sequence, or portions thereof, encoding a SURH and 
its derivatives entirely by synthetic chcmistr>'. after which the synthetic gene may be inserted into 

-7- 
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any of the many available DNA vectors and cell systems using reagents that are well known in 
the an at the time of the filing of this application. Moreover, synthetic chemistry may be used to 
introduce mutations into a gene encoding SURH. 

Also included within the scope of the present invention are polynucleotide sequences that 
5 are capable of hybridizing to the nucleotide sequences of Figures 1 A-IM under various 

conditions of stringency. Hybridization conditions are based on the melting temperature (Tm) of 
the nucleic acid binding complex or probe, as taught in Berger and Kimmel (1987, Guide Iq 
Molecular Cloning Techniques . Methods in EnzA^mologv . Vol 1 52, Academic Press. San Diego 
CA) incorporated herein by reference, and may be used at a defined stringency. . 

10 Altered nucleic acid sequences encoding SURH which may be used in accordance with 

the invention include deletions, insertions or substitutions of different nucleotides resulting in a 
polynucleotide that encodes the same or a functionally equivalent SURH. The protein may also 
show deletions, insertions or substitutions of amino acid residues which produce a silent change 
and result in a functionally equivalent SURH. Deliberate amino acid substitutions may be made 

1 5 on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity. 2md/or the 
amphipathic nature of the residues as long as the biological activity of SURH is retained. For 
example, negatively charged amino acids include aspartic acid and glutamic acid: positively 
charged amino acids include lysine and arginine; and amino acids with uncharged polar head 
groups having similar hydrophilicity values include leucine, isolcucmc. valine: glycine, alanine: 

20 asparagine. glutamine; serine, threonine phenylalanine, and tyrosine. 

Included within the scope of the present invention are alleles of SURH. As used herein. 
an "allele" or "allelic sequence" is an alternative form of SURH. Alleles result from a mutation, 
ie. a change in the nucleic acid sequence, and generally produce altered mRNAs or polypeptides 
whose structure or function may or may not be altered. Any given gene may have none, one or 

25 many allelic forms. Common mutational changes which give rise to alleles are generally 

ascribed to natural deletions, additions or substitutions of amino acids. Each of these types of 
changes may occur alone, or in combination with the others, one or more times in a given 
sequence. 

Methods for DNA sequencing are well known in the art and employ such enzymes as the 
30 Klenow fragment of DNA polymerase 1. Sequenase KJ (US Biochemical Corp, Cleveland OH)), 
Taq polymerase (Perkin Elmer, Norwaik CT). thermostable T7 polymerase f Amersham, Chicago 
ID. or combinations of recombinant polymerases and proofreading cxonucleases such as the 
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ELONGASE Ampiification System marketed by Gibco BRL (Gaiihersburg MD). Preferably, the 
process is automated with machines such as the Hamihon Micro Lab 2200 (Hamilton. Reno NV). 
Peltier Thermal Cycier (PTC200: MJ Research, WatenovvTi MA) and the ABI 377 DNA 
sequencers (Perkin Elmer). 
5 Extending the Polynucleotide Sequence 

The polynucleotide sequence encoding SURH may be extended utilizing partial 
nucleotide sequence and various methods known in the art to detect upstream sequences such as 
promoters and regulatory elements. Gobinda et al (1993; PGR Methods Applic 2:318-22) 
disclose "restriction-site" polymerase chain reaction (PGR) as a direct method which uses 

10 universal primers to retrieve unknown sequence adjacent to a known locus. First, genomic DNA 
is amplified in the presence of primer to a linker sequence and a primer specific to the known 
region. The amplified sequences are subjected to a second round of PGR with the same linker 
primer and another specific primer internal to the first one. Products of each round of PGR are 
transcribed with an appropriate RN A polymerase and sequenced using reverse transcriptase. 

15 Inverse PGR can be used to amplify or extend sequences using divergent primers based 

on a knowTi region (Trigiia T et al (1988) Nucleic Acids Res 16:8186). The primers may be 
designed using OLIGO® 4.06 Primer Analysis Software (1992: National Biosciences Inc, 
Plymouth MN), or another appropriate program, to be 22-30 nucleotides in length, to have a GG 
content of 50% or more, and to anneal to the target sequence at temperatures about 68°-72'^ C, 

20 The method uses several restriction enzymes to generate a suitable fragmcm in the kno\\Ti region 
of a gene. The fragment is then circularized by intramoiecuiar ligation and used as a PCR 
template. 

Gapture PCR (Lagerstrom M et al ( 1 99 1 ) PCR Methods Applic 1 : 1 1 1 - 1 9) is a method for 
PGR amplification of DNA fragments adjacent to a known sequence in human and yeast artificial 
25 chromosome DNA. Capture PGR also requires multiple restriction enzyme digestions and 

ligations to place an engineered double-stranded sequence mto an unknown ponion of the DNA 
molecule before PCR. 

Another method which may be used to retrieve unknown sequences is that of Parker JD et 
al (1991: Nucleic Acids Res 19:3055-60). Additionally, one can use PCR, nested primers and 
30 PromoierFinder libraries to walk in genomic DNA (PromoterFinder^^^ Clontech (Palo Alto CA). 
This process avoids the need to screen libraries and is useful in finding intron/'exon junctions. 

Preferred libraries for screening for full length cDNAs are ones that have been 
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size-seiectcd to include larger cDNAs. Also, random primed iibranes are preferred in that they 
will contain more sequences which contain the 5' and upstream regions of genes. A randomly 
pnmed library may be particularly useful if an oUgo d(T) library does not yield a full-length 
cDNA- Genomic libraries are useful for extension into the 5' nontranslated regulatory region. 
5 Capillary electrophoresis may be used to analyze the size or confirm the nucleotide 

sequence of sequencing or PCR products. Systems for rapid sequencing are available from 
Perkin Elmer. Beckman Instruments (Fullenon CA). and other companies. Capillary sequencing 
may employ flowabie polymers for electrophoretic separation, four different fluorescent dyes 
(one for each nucleotide) which are laser activated, and detection of the emined wavelengths by a 

10 charge coupled devise camera. Output/light intensity is converted to electrical signal using 
appropriate software (eg. Genotyper'^'^ and Sequence Navigator^"^ from Perkin Elmer) and the 
entire process from loading of samples to computer analysis and electronic data display is 
computer controlled. Capillar)' electrophoresis is particularly suited to the sequencing of small 
pieces of DNA which might be present in limited amounts in a particular sample. The 

1 5 reproducible sequencing of up to 350 bp of Ml 3 phage DNA in 30 min has been reported 
(Ruiz-Martinez MC et al (1993) Anal Chem 65:2851-8). 
Expression of the Nucleotide Sequence 

In accordance with the present invention, polynucleotide sequences which encode SURH, 
fragments of the polypeptide, fusion proteins or functional equivalents thereof may he used in 

20 recombinant DNA molecules that direct the expression of SURH in appropriate host cells. Due 
to the inherent degeneracy of the genetic code, other DNA sequences which encode substantially 
the same or a functionally equivalent amino acid sequence, may be used to clone and express 
SURH. As will be understood by those of skill in the art. it may be advantageous to produce 
SURH-encoding nucleotide sequences possessing non-naturally occurring codons. Codons 

25 preferred by a particular prokaryotic or eukar\'otic host (Murray E et al (1989) Nuc Acids Res 
17:477-508) can be selected, for example, to increase the rate of SURH expression or to produce 
recombinant RNA transcripts having desirable properties, such as a longer half-life, than 
transcripts produced from naturally occurring sequence. 

The nucleotide sequences of the present invention can be engineered in order to alter a 

30 coding sequence of SURH for a variety of reasons, including but not limited to, alterations which 
modify the cloning, processing and/or expression of the gene product. For example, mutations 
may be introduced using techniques which arc well known in the an. eg. site-directed 
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mutagenesis to insen new restriction sites, to alter giycosylation panems. to change codon 
preference, to produce splice variants, etc. 

In another embodiment of the invention, a natural, modified or recombinant nucieotide 
sequence encoding SURH may be ligated to a heterologous sequence to encode a fusion protein. 
5 For example, for screening of peptide libraries for inhibitors of SURH activity, it may be useful 
to encode a chimeric SURH protein that is recognized by a commercially available antibody. A 
fusion protein may also be engineered to contain a cleavage site located between a SURH 
sequence and the heterologous protein sequence, so that the SURH may be cleaved and 
substantially purified away from the heterologous moiety. 

10 In an alternate embodiment of the invention, the coding sequence for SURH may be 

synthesized, whole or in part, using chemical methods well knowTi in the art (see Caruthers MH 
et al fl980) Nuc Acids Res Symp Ser 215-23. Horn T et aid 980) Hue Acids Res Symp Ser 
225-32. etc.). Alternatively, the protein itself could be produced using chemical methods to 
synthesize a SURH amino acid sequence, whole or in part. For example, peptide synthesis can 

1 5 be performed using various solid-phase techniques fRoberge JY et al (1995) Science 

269:202-204) and automated synthesis may be achieved, for example, using the ABI 43 1 A 
Peptide Synthesizer (Perkin Elmer) in accordance with the instructions provided by the 
manufacturer. 

The newly synthesized peptide can be substantially purified by preparative high 
20 performance liquid chromatography (eg. Creighton (1983) Proteins . Structures and Molecular 
Principles . WH Freeman and Co. New York NYV The composition of the synthetic peptides 
may be confirmed by amino acid analysis or sequencing (eg. the Edman degradation procedure; 
Creighton. supra). Additionally the amino acid .sequence of SURH. or any part thereof, may be 
altered during direct synthesis and/or combined using chemical methods with sequences from 
25 other protems. or any part thereof, to produce a variant polypeptide. 
Expression Systems 

In order to e.xpress a biologically active SURH. the nucieotide sequence encoding SURH 
or its functional equivalent is inserted into an appropriate expression vector, ie, a vector which 
contains the necessar\' elements for the transcription and translation of the inserted coding 
30 sequence. 

Methods which are well known to those skilled in the art can be used to construct 
expression vectors containing a SURH coding sequence and appropriate transcriptional or 
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iranslational controls. These methods include in vitro recombinant DN A techniques, synthetic 
techniques and in vivo recombination or genetic recombination. Such techniques are described in 
Sambrook et al ( 1 989) Molecular Cloning . A Laboratory Manual Cold Spring Harbor Press. 
Plainvievv NY and Ausubel FM et ai ( 1 989) Current Protocols in Molecular Biolog y. John Wiley 
5 & Sons. New York NY. 

A variety of expression vector/host systems may be utilized to contain and express a 
SURH coding sequence. These include but are not limited to microorganisms such as bacteria 
transformed with recombinant bacteriophage, plasmid or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected with virus expression 

10 vectors (eg. baculovirus); plant cell systems transfected with virus expression vectors (eg, 
cauliflower mosaic virus. CaMV; tobacco mosaic virus. TMV) or transformed with bacterial 
expression vectors (eg. Ti or pBRj22 plasmid): or animal cell systems. 

The "control elements" or "regulator}' sequences" of these systems vary in their strength 
and specificities and arc those noniransiated regions of the vector, enhancers, promoters, and 3' 

1 5 untranslated regions, which interact with host cellular proteins to carrv' out transcription and 
translation. Depending on the vector system and host utilized, any number of suitable 
transcription and translation elements, including constitutive and inducible promoters, may be 
used. For example, when cloning in bacterial systems, inducible promoters such as the hybrid 
iacZ promoter of the Bluescnpti?; phagcmid (Stratagene. LaJolIa CA) or pSponl CGibco BRJL) 

20 and ptrp-lac hybrids and the like may be used. The baculovirus polyhedrin promoter may be 
used in insect cells. Promoters or enhancers derived from the genomes of plant cells (eg, heat 
shock. RUBISCO: and storage protein genes * or from plant viruses (eg. viral promoters or leader 
sequences) may be cloned into the vector. In mammalian cell systems, promoters from the 
mammalian genes or from mammalian viruses are most appropriate. If it is necessarv' to generate 

25 a cell line that contams muhiple copies of SURH. vectors based on SV40 or EBV may be used 
with an appropriate selectable marker. 

In bacterial systems, a number of expression vectors may be selected depending upon the 
use intended for SURH. For example, when large quantities of SURH are needed for the 
induction of antibodies, vectors which direct high level expression of fusion proteins that are 

30 readily purified ma> be desirable. Such vectors include, but arc not limited to, the 

multifunctional E. coli cloning and expression vectors such as BluescriptiE) fStratagene), in which 
the SURH coding sequence may be Hgatcd into the vector in frame with sequences for the 
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amino-terminal Met and the subsequent 7 residues of B-gaiactosidase so that a hybrid protein is 
produced: pIN vectors (Van Heeke & Schuster ( 1989) J Bioi Chem 264:5503-5509); and the like. 
pGEX vectors (Promega. Madison WI) may also be used to express foreign polypeptides as 
fusion proteins with glutathione S-transferase (GST). In general, such fusion proteins are soluble 
5 and can easily be purified from iysed cells by adsorption to glutathione-agarose beads followed 
by elution in the presence of free glutathione. Proteins made in such systems are designed to 
include heparin, thrombin or factor XA protease cleavage sites so that the cloned polypeptide of 
interest can be released from the GST moiety at will. 

In the yeast. Saccharomvces cerevi^iae . a number of vectors containing constitutive or 

10 inducible promoters such as alpha factor, alcohol oxidase and PGH may be used. For reviews, 
see Ausubel et al (supra) and Grant et al (1987) Methods in Enzymology 153:516-544. 

In cases where plant expression vectors are used, the expression of a sequence encoding 
SURH may be driven by any of a number of promoters. For example, viral promoters such as the 
35S and \9S promoters of CaMV (Brisson e: al (1984) Nature 310:51 1-514) may be used alone 

15 or in combination with the omega leader sequence from TMV (Takamatsu et al (1987) EMBO J 
6:307-31 1 ). Alternatively, plant promoters such as the small subunit of RUBISCO (Coruzzi et al 
(1984) EMBO J 3:1671-1680: Broglie et al (1984) Science 224:838-843): or heat shock 
promoters (Winter J and Sinibaldi RM (1991) Results Probl Cell Differ 17:85-105) may be used. 
These constructs can be introduced into plant cells by direct DNA transformation or 

20 pathogen-mediated transfection. For reviews of such techniques, see Hobbs S or Murry LE in 
McGraw Hill Yearbook of Science and Technolog y (1992) McGraw Hill New York NY, pp 
1 9 1 - 1 96 or Weissbach and Weissbach ( 1 988) Methods for Plant Molecular Biolog y. Academic 
Press. New York NY. pp 421-463. 

.An alternative expression system which could be used to express SURH is an insect 

25 system. In one such system, Autographa califomica nuclear polyhedrosis virus ( AcNPV) is used 
as a vector to express foreign genes in Spodoptera frugipcrda cells or in Trichoplusia larvae. The 
SURH coding sequence may be cloned into a nonessential region of the virus, such as the 
polyhedrin gene, and placed under control of the polyhedrm promoter. Successful insertion of 
the SURH coding sequence will render the polyhedrin gene inactive and produce recombinant 

30 virus lacking coat protein coal. The recombmani viruses are then used to infect S. frugiperda 
cells or Trichoplusia lar\'ae m which SURH i> expressed (Smith et al (1983) J Virol 46:584: 
Engelhard EK et al (1994) Proc Nat Acad S^i 'M .3:24-7). 
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In mammalian hosi ceils, a number of viral-based expression systems may be utilized. In 
cases where an adenovirus is used as an expression vector, a coding sequence for SURH may be 
ligated into an adenovirus transcription/translation complex consisting of the late promoter and 
tripartite leader sequence. Insertion in a nonessential EI or E3 region of the viral genome will 

5 result in a viable virus capable of expressing SURH in infected host cells (Logan and Shenk 
(1984) Proc Natl Acad Sci 81 :3655-59). In addition, transcription enhancers, such as the rous 
sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. 

Specific initiation signals may also be required for efficient translation of a SURJ-I 
sequence. These signals include the ATG initiation codon and adjacent sequences. In cases 

10 where nucleic acid encoding SURPl, its initiation codon and upstream sequences are inserted into 
the appropriate expression vector, no additional transiational control signals may be needed. 
However, in cases where only coding sequence, or a portion thereof, is inserted, exogenous 
transiational control signals including the ATG initiation codon must be provided. Furthermore, 
the initiation codon must be in the correct reading frame to ensure translation of the entire insert. 

15 Exogenous transiational elements and initiation codons can be of various origins, both natural 
and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers 
appropriate to the cell system in use (Scharf D el al (1994) Results Probl Cell Differ 20:125-62: 
Binneretal (1987) Methods in Enzymol 153:516-544). 

In addition, a host cell strain may be chosen for its ability to modulate the expression of 

20 the inserted sequences or to process the expressed protein in the desired fashion. Such 
modifications of the polypeptide include, but arc not limited to, acctyiation. carboxyiation. 
glycosyiation. phosphorylation, iipidation and acylation. Post-translational processing which 
cleaves a "prepro" form of the protein may also be important for correct insertion, folding and/or 
function. Different host cells such as CHO, HcLa, MOCK, 293. WT38, etc have specific cellular 

25 machiner>' and characteristic mechanisms for such post-translational activities and may be chosen 
to ensure the correct modification and processing of the introduced, foreign protein. 

For long-term, high-yield production of recombinant proteins, stable expression is 
preferred. For example, cell lines which stably express SURH may be transformed using 
expression vectors which contam viral origins of replication or endogenous expression elements 

30 and a selectable marker gene. Following the introduction of the vector, cells may be allowed to 
grow for 1-2 days in an enriched media before ihcy are switched to selective media. The purpose 
of the selectable marker is to confer resistance ii» selection, and its presence allows growih and 
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recQver\' of ceils which successfiiUy express the introduced sequences. Resistant clumps of 
stably transformed cells can be proliferated using tissue culture techniques appropriate to the cell 
type. 

Any number of selection systems may be used to recover transformed cell lines. These 
5 include, but are not limited to, the herpes simplex virus thymidine kinase (Wigler M et al (1977) 
Cell 11:223-32) and adenine phosphoribosyltransferase (Lowy I et al (1980) Cell 22:817-23) 
genes which can be employed in tk- or aprt- cells, respectively. Also, antimetabolite, antibiotic 
or herbicide resistance can be used as the basis for selection: for example, dhfr which confers 
resistance to methotrexate (Wigler M et al (1980) Proc Natl Acad Sci 77:3567-70); apt. which 

10 confers resistance to the aminoglycosides neomycin and G-418 (Colbere-Garapin F et al (1981) J 
Mol Biol 150:1-14) and als or pat. which confer resistance to chlorsulfuron and phosphinotncin 
ace t>'i transferase, respectively (Murr>'. supra). Additional selectable genes have been described, 
for example, trpB. which allows ceils to utilize indole in place of tryptophan, or hisD. which 
allows cells to utilize histinoi in place of histidine (Hanman SC and RC Mulligan (1988) Proc 

15 Nati Acad Sci 85:8047-51). Recently, the use of visible markers has gained popularity with such 
markers as anthocyanins, fi glucuronidase and its substrate. GUS, and luciferase and its substrate, 
luciferin. being widely used not only to identify trans formants. but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system (Rhodes CA et al 
(1995) Methods Mol Biol 55:121-131). 

20 Identification of Transformants Containing the Foiynucieotidc Sequence 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, its presence and expression should be confirmed. For example, if the 
SURH polynucleotide sequence is inserted within a marker gene sequence, recombinant cells 
containing SURH can be identified by the absence of marker gene function. Alternatively, a 

25 marker gene can be placed in tandem with a SURH sequence under the control of a single 

promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem SURH as well. 

Alternatively, host cells which contain the coding sequence for SURIl and express SURH 
may be identified by a variety of procedures known to those of skill in the art. These procedures 

30 include, but are not limited to. DNA-DNA or DN A-RNA hybridization and protein bioassay or 
immunoassay techniques which include membrane, solution, or chip based technologies for the 
detection and/'or quantification of the nucleic acid or protein. 
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The presence of the polynucleotide sequence encoding SURJ-i can be detected by 
DNA'DNA or DNA-RNA hybridization or amplification using probes, portions or fragments of 
SURH-encoding nucleotides. Nucleic acid amplificaiion based assays involve the use of 
oligonucleoiides or oligomers based on the SURH sequence to detect transformants containing 
5 SURH DNA or RNA. As used herein '"oligonucleotides" or "oligomers" refer to a nucleic acid 
sequence of at least about 10 nucleotides and as many as about 60 nucleotides, preferably about 
1 5 to 30 nucleotides, and more preferably about 20-25 nucleotides which can be used as a probe 
or amplimer. 

A variety of protocols for detecting and measuring the expression of SURH, using either 

10 polyclonal or monoclonal antibodies specific for the protein are known in ihe art. Examples 
include enz>'me-linked immunosorbent assay (CLISA). radioimmunoassay (RJA ) and fluorescent 
activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on SURH is preferred, but a competitive 
binding assay may be employed. These and other assays are described, among other places, in 

1 5 Hampton R et al ( 1 990, Serological Methods , a Laboratory Manual . APS Press, St Paul MN) and 
Maddox DE et al ( 1 983, J Exp Med 158:1211). 

A wide variety of labels and conjugation techniques are known by those skilled in the an 
and can be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to SURH include oligolabeling. nick 

20 translation, end-labeling or PCR amplification using a labeled nucleotide. Alternatively, the 

SURH sequence, or any ponion of it, may be cloned into a vector for the production of an mRNA 
probe. Such vectors are known in the art. arc commercially available, and may be used to 
synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7. T3 or 
SP6 and labeled nucleotides. 

25 A number of companies such as Pharmacia Biotech (Piscataway NJ), Promega (Madison 

WI). and US Biochemical Corp (Cleveland OH) supply commercial kits and protocols for these 
procedures. Suitable reponer molecules or labels include those radionuclides, enzymes, 
fluorescent, chemiluminescent. or chromogenic agents as well as substrates, cofactors. inhibitors, 
magnetic particles and the like. Patents teaching the use of such labels include US Patents 

30 3.817.837. 3,850.752; 3.939.350; 3,996,345; 4.277.437; 4,275.149 and 4.366.241. Also, 
recombmant immunoglobulins may be produced as shown in US Patent No. 4.816,567 
incorporated herein by reference. 
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Purification of SURH 

Host cells transformed with a SURH-encodin^ nucleotide sequence may be cuiiured 
under conditions suitable for the expression and recovery of the encoded protein from cell 
culture. The protein produced by a recombinant cell may be contained intracellular!)' or secreted 
5 depending on the sequence and/or the vector used. As will be understood by those of skill in the 
art. expression vectors containing SURH can be designed for efficient production and proper 
transmembrane insertion of SURH into a prokaryotic or eukaryotic cell membrane. Other 
recombinant constructions may join SURH to nucleotide sequence encoding a polypeptide 
domain which will facilitate purification of soluble proteins (Kroll DJ et al (1993) DNA Cell Biol 

10 12:441-53: cf discussion of vectors infra containing fusion proteins). 

SURH ma\ also be expressed as a recombinant protein with one or more additional 
polypeptide domams added to facilitate protein purification. Such purification facilitatmg 
domains include, but are not limited to. metal chelating peptides such as histidine-tr^-ptophan 
modules that allow purification on immobilized metals, protein A domains that allow purification 

15 on immobilized immunoglobulin, and the domain utilized in the FLAGS exiension/'affinity 

purification system (Immunex Corp. Seattle WA). The inclusion of a cleavable linker sequences 
such as Factor XA or enterokinase (Invitrogen* San Diego CA) between the purification domain 
and SURH is useful to facilitate purification. One such expression vector provides for expression 
of a fusion protein compromising an SURH and contains nucleic acid encoding 6 histidine 

20 residues followed by thioredoxin and an enterokinase cleavage site. The histidine residues 

facilitate purification on [MI AC ummobilized metai ion affinity chromatography as described in 
Porath et al (1992) Protein Expression and Purification 3:263-281) while the enterokinase 
cleavage site provides a means for purifying the protein from the fusion protein. 

in addition to recombinant production, fragments of SURH may be produced by direct 

25 peptide synthesis using solid-phase techniques { cf Siewan et al( 1 969) Solid>Phase Peptide 

Synthesis . WH Freeman Co. San Francisco; Merrifield J (1963) J Am Chem Soc 85:2149-2154). 
in vitro protein synthesis may be performed using manual techniques or by automation. 
Automated synthesis may be achieved, for example, using Applied Biosystcms 431 A Peptide 
Synthesizer (Perkin Elmer. Foster City CA) in accordance with the instructions provided by the 

30 manufacturer. Various fragments of SURH may be chemically synthesized separately and 
combined using chemical methods to produce the full length molecule. 
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Uses ofSURH 

The rationale for the use of polypeptide and polynucleotide sequences disclosed herein is 
based in part on the chemical and structural homology among SURH and a human isoform. a rat 
homolog and a hamster homolog of SUR. 
5 The SUR/K^TP channel complex plays a role in neurosecretion and is imphcated in the 

response of the brain to hyper- and hypoglycemia and ischemia. In the pancreas, it is a critical 
lixik between blood glucose concentration and insulin secretion. In cardiac cells, action potentials 
are modulated by sulfonylurea (SU) compounds binding to SURs. Accordingly, SURH may be 
used in the diagnosis and treatment of diseases and conditions such as. but not limited to. type II 
10 diabetes (NIDDM), hyper- and hypoglycemia (including PHHH, cardiac impulse disorders (such 
as arrhythmias and tachycardias), and other disorders relating to SUR and the SUR/K,vrr channel 
complex. 

Some SU therapeutics, while highly effective in the treatment of NIDDM and related 
disorders, have adverse side-effects. SUs can cause severe and prolonged hypoglycemia. 

1 5 requiring massive glucose infusions and hospitalization for several days. In addition, some SUs 
can cause skin lesions, including drug-induced erythroderma (exfoliative dermatitis). The isolated 
SURH protein or its fragments may therefore be useful as a target in drug discovery programs to 
screen for novel therapeutic molecules with, for example, more desirable binding characteristics, 
more efficacious metabolic lifetimes, or fewer or less debilitatmg side-effects than conventional 

20 SU therapeutics. 

SURH or its fragments may be used to identify other specific molecules with which it 
binds such as agonists or antagonists. 

SURH-specific antibodies are useful for the diagnosis and treatment of conditions and 
diseases associated with expression of the polypeptides. Antibodies specifically recognizing 
25 SURH may be used to quantitate SURH for diagnostic purposes. Therapeutic antibodies may be 
used to block or modif\' the mteractions between SUs and SURH, or SURH and K^r? channel, in 
order to treat diseases or conditions associated with SURH and/or the K^j^, channel- 
In some instances it may be advantageous to suppress SURH expression. Expression of 
mutant SURH sequences may be suppressed by administration of SURH antisense 
30 oligonucleotides. 

The SURH nucleic acid sequence of SEQ ID NO:2 can be incorporated mto effective 
eukarv'otic expression vectors and directly administered into somatic cells for gene therapy. In 
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like manner. RNA transcripts produced in vitro may be encapsulated in and administered via 
liposomes. Such vectors and transcripts may function transiently or may be incorporated into the 
host chromosomal DNA for longer term expression. 

In vivo delivery of genetic constructs into subjects is developed to the point of targeting 
5 specific ceii types. The delivery' to specific ceils has been accomplished, for instance, by 
compiexing nucleic acids with proieinous ligands that recognize cell specific receptors which 
mediate uptake (cf Wu GY et al (1991) J Biol Chem 266:14338-42). Alternatively, recombinant 
nucleic acid constructs may be injected directly for local uptake and integration (Jiao S et al 
(1992) Human Gene Therapy 3:21-33). 
10 SURH Antibodies 

SURH-specific antibodies are useful for the diagnosis of conditions and diseases 
associated with expression of SURH. Such antibodies may include, but are not limited to, 
polyclonal, monoclonal, chimeric, single chain. Fab fragments and fragments produced by a Fab 
expression librar}'. Neutralizing antibodies, ie, those which inhibit dimer formation, are 
15 especially preferred for diagnostics and therapeutics. 

SURH for antibody induction does not require biological activity; however, the protein 
fragment, or oligopeptide must be antigenic. Peptides used to induce specific antibodies may 
have an amino acid sequence consisting of at least five amino acids, preferably at least 10 amino 
acids. Preferably, they should mimic a portion of the amino acid sequence of the natural protein 
20 and may contain the entire amino acid sequence of a small, naturally occurring molecule. Short 
stretches of SURH amino acids may be fused with those of another protein such as keyhole 
limpet hemocyanin and antibody produced against the chimeric molecule. Procedures well 
known in the art can be used for the production of antibodies to SURH. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, etc 
25 may be immunized by injection with SURH or any ponion. fragment or oligopeptide which 
retains immunogenic properties. Depending on the host species, various adjuvants may be used 
to increase immunological response. Such adjuvants include but are not limited to. Freund's. 
mineral gels such as aluminum hydroxide, and surface active substances such as iysoiecithin. 
piuronic poiyols. poiyanions, peptides, oil emulsions, keyhole limpet hemocyanin, and 
30 dinitrophenol. BCG (bacilli Calmette-Guerin) and Corv^nebacterium par\aim are potentially 
useful human adjuvants. 

Monoclonal antibodies lo SURH may be prepared using any technique which provides for 
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the production of antibody molecules by continuous cell lines in culture. These include but are 
not limited to the hybridoma technique originally described by Koehier and Milsiein (1975 
Nature 256:495-497), the human B-ceil hybridoma technique (Kosbor et al (1983) Innmunol 
Today 4:72: Cote et al (1983) Proc Natl Acad Sci 80:2026-2030) and the EBV-hybridoma 
5 technique (Cole et al ( 1 985) Monoclonal Antibodies and Cancer Therapy . Alan R Liss Inc. New 
York NY, pp 77-96), 

In addition, techniques developed for the production of "chimeric antibodies", the splicing 
of mouse antibody genes to human antibody genes to obtain a molecule with appropriate antigen 
specificity and biological activity can be used (Morrison et al ( 1 984) Proc Natl Acad Sci 

10 81:6851-6855: Neuberger et al (1984) Nature 312:604-608: Takeda et al (1985) Nature 

3 14:452-454), Alternatively, techniques described for the production of single chain antibodies 
(US Patent No. 4.946,778) can be adapted to produce SURM-specific single cham antibodies 

Antibodies may also be produced by inducing m vi vo production in the lymphocyte 
population or by screening recombinant immunogiobulin libraries or panels of highly specific 

15 bindmg reagents as disclosed in Orlandi et al (1989, Proc Natl Acad Sci 86:3833-3837), and 
Winter G and Milstcm C (1991; Nature 349:293-299). 

Antibody fragments which contain specific binding sites for SURH may also be 
generated. For example, such fragments include, but are not limited to. the F(ab')2 fragments 
which can be produced by pepsin digestion of the antibody molecule and the Fab fragments 

20 which can be generated by reducmg the disulfide bridges of the F(ab*)2 fragments. Alternatively, 
Fab expression libraries may be constructed to allow rapid and easy identification of monoclonal 
Fab fragments with the desired specificity (Husc WD et al (1989) Science 256: 1 275- 128 1 ). 

A variety of protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the an. 

25 Such immunoassays typically involve the formation of complexes between SURH and its 
specific antibody and the measurement of complex formation. A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two nonimerfenng epitopes on a 
specific SURFI protein is preferred, but a competitive binding assay may also be employed. 
These assays arc described in Maddox DE ei al (1983, J Exp Med 158:121 1). 

30 Diagnostic Assays Using SDRH Specific Antibodies 

Particular SURH antibodies are useful for the diagnosis of conditions or diseases 
characterized by expression of SURFI or in assays to monitor patients being treated with SURH, 
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agonists or inhibitors. Diagnosiic assays for SURH include methods utilizing the antibody and a 
label to detect SURH in human body fluids or extracts of cells or tissues. The polypeptides and 
antibodies of the present invention may be used with or without modification- Frequently, the 
polypeptides and antibodies will be labeled by joining them, either covalently or noncovaiently, 
5 with a reporter molecule. A wide variety of reporter molecules are known, several of which were 
described above. 

A variety of protocols for measuring SURH, using either polyclonal or monoclonal 
antibodies specific for the respective protein are known in the an. Examples include 
enzyme-iinked immunosorbent assay (ELISA), radioimmunoassay (RIA) and fluorescent 

10 activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on SURH is preferred, but a competitive 
binding assay may be employed. These assays are described, among other places, in Maddox, 
DE et al (1983. J Exp Med 158:121 1 ). 

In order to provide a basis for diagnosis, normal or standard values for SURH expression 

1 5 must be established. This is accomplished by combining body fluids or cell extracts taken from 
normal subjects, either animal or human, with antibody to SURH under conditions suitable for 
complex formation which are well known in the art. The amount of standard compiex formation 
may be quantified by comparing various anificiai membranes containing known quantities of 
SURH with both control and disease samples from biopsied tissues. Then, standard values 

20 obtained from normal samples may be compared with values obtained from samples from 
subjects potentially affected by disease. Deviation between standard and subject values 
establishes the presence of disease state. 
Drug Screening 

SURH. its catalytic or immunogenic fragments or oligopeptides thereof, can be used for 
25 screening therapeutic compounds in any of a variety of drug .screening techniques. The fragment 
employed in such a test may be free in solution, affixed to a solid support, borne on a ceil surface, 
or located intraceilularly. The formation of binding complexes, between SURH and the agent 
being tested, may be measured. 

Another technique for drug screening which may be used for high throughput screening of 
30 compounds having suitable bindmg affinity to the SURH is described in detail in ''Determination 
of Amino Acid Sequence Antigenicity'' by Geysen HN. WO Application 84/03564. published on 
September 13, 1984, and incorporated herein by reference. In summary', large numbers of 
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different small peptide test compounds are synthesized on a solid substrate, such as plastic pms or 
some other surface. The peptide test compounds are reacted with fragments of SURH and 
washed. Bound SURH is then detected by methods well kjiown in the art. Substantially purified 
SURH can also be coated directly onto plates for use in the aforementioned drug screening 

5 techniques. Alternatively, non-neutralizing antibodies can be used to capture the peptide and 
immobilize it on a solid support. 

This invention also contemplates the use of competitive drug screening assays in which 
neutralizing antibodies capable of binding SURH specifically compete with a test compound for 
binding SURH. In this manner, the antibodies can be used to detect the presence of any peptide 

10 which shares one or more antigenic determinants with SURH. 
Uses of the Polynucleotide Encoding SURH 

A polynucleotide encoding SURH. or any part thereof, may be used for diagnostic and/or 
therapeutic purposes. For diagnostic purposes, the SURH of this invention may be used to detect 
£md quantitate gene expression in biopsied tissues in which expression of SURH may be 

1 5 implicated. The diagnostic assay is useful to distinguish between absence, presence, and excess 
expression of SURH and to monitor regulation of SURH levels during therapeutic inter\'ention. 
Included in the scope of the invention are oligonucleotide sequences, antisense RNA and DNA 
molecules, and PNAs. 

Another aspect of the subject invention is to provide for hybridization or PCR probes 

20 which are capable of detecting polynucleotide sequences, including genomic sequences, encoding 
SURH or closely related molecules. The specificity of the probe, whether it is made from a 
highly specific region, eg. 10 unique nucleotides in the 5' regulator)' region, or a less specific 
region, eg. especially in the 3* region, and the stringency of the hybridization or amplification 
(maximal, high, intermediate or low) will determine whether the probe identifies only naturally 

25 occurring SURIL alleles or related sequences. 

Probes may also be used for the detection of related sequences and should preferably 
contain at least 50% of the nucleotides from any of these SURH encoding sequences. The 
hybndization probes of the subject invention may be derived from the nucleotide sequence of 
SEQ ID N0:2 or from genomic sequence including promoter, enhancer elements and introns of 

30 the naturally occumng SURl-L Hybridization probes may be labeled by a variety of reponer 
groups, including radionuclides such as "-P or ""'S, or enTrymatic labels such as alkaline 
phosphatase coupled to the probe via aviditv'biotin coupling systems, and the like. 
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Other means for producing specific hybridization probes for SURU DN As include the 
cloning of nucleic acid sequences encoding SURH or SURJ-I derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art and are commercially available 
and may be used to synthesize RN A probes in vitrp by means of the addition of the appropriate 
5 RNA polymerase as T7 or SP6 RNA polymerase and the appropriate radioactively labeled 
nucleotides. 
Diagnostic Use 

Polynucleotide sequences encoding SURH may be used for the diagnosis of conditions or 
diseases with which the expression of SURH is associated. For example, polynucleotide 

10 sequences encoding SURH may be used in hybridization or PCR assays of fluids or tissues from 
biopsies to detect SURH expression. The form of such qualitative or quantitative methods may 
include Southern or northern analysis, dot blot or other membrane-based technologies; PCR 
technologies: dip stick, pin. chip and ELISA technologies. All of these techniques are well 
known in the an and are the basis of many commercially available diagnostic kits. 

15 The SURH nucleotide sequence disclosed herein provide the basis for assays that detect 

activation or induction associated with inflammation or disease. The SURH nucleotide sequence 
may be labeled by methods known in the art and added to a fluid or tissue sample from a patient 
under conditions suitable for the formation of hybridization complexes. After an incubation 
period, the sample is washed with a compatible fluid which optionally contains a dye (or other 

20 label requiring a developer) if the nucleotide has been labeled with an enzyme. After the 

compatible fluid is rinsed off, the dye is quantitated and compared with a standard. If the amount 
of dye in the biopsied or extracted sample is significantly elevated over that of a comparable 
control sample, the nucleotide sequence has hybridized with nucleotide sequences in the sample, 
and the presence of elevated levels of SURH nucleotide sequences in the sample indicates the 

25 presence of the associated inflammation and/or disease. 

Such assays may also be used to evaluate the efficacy of a particular therapeutic treatment 
regime m animal studies, in clinical trials, or in monitoring the treatment of an individual patient. 
In order to provide a basis for the diagnosis of disease, a normal or standard profile for SURH 
expression must be established. This is accomplished by combining body fluids or cell extracts 

30 taken from normal subjects, cither animal or human, with SURl-I, or a portion thereof under 
conditions suitable for hybridization or amplirlcation. Standard hybridization may be quantified 
by comparing the values obtained for normal subjccis with a dilution series of SURH run in the 
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same experiment where a known amount of substantially purified SURH is used. Standard 
values obtained from normal samples may be compared with values obtained from samples from 
patients afflicted with SURH-associated diseases. Deviation between standard and subject values 
is used to establish the presence of disease. 
5 Once disease is established, a therapeutic agent is administered and a treatment profile is 

generated. Such assays may be repeated on a regular basis to evaluate whether the values in the 
profile progress toward or return to the normal or standard pattern. Successive treatment profiles 
may be used to show the efficacy of treatment over a period of several days or several months. 
Polymerase Chain Reaction (PGR) as described in US Patent Nos. 4.683.195 and 

10 4,965,188 provides additional uses for oligonucleotides based upon the SURl-l sequence. Such 
oligomers are generally chemically synthesized, but they may be generated enz>'matically or 
produced from a recombinant source. Oligomers generally comprise two nucleotide sequences, 
one with sense orientation (5'->3') and one with antisense (3'<-5*), employed under optimized 
conditions for identification of a specific gene or condition. The same two oligomers, nested sets 

15 of oligomers, or even a degenerate pool of oligomers may be employed under less stringent 
conditions for detection and/or quantitation of closely related DNA or RNA sequences. 

Additionally, methods which may be used to quantitate the expression of a particular 
molecule include radiolabeimg (Melby PC ei al 1993 J Immunol Methods 159:235*44) or 
biotinylating (Dupiaa C et al 1993 Anal Biochem 229-36) nucleotides, coamplification of a 

20 control nucleic acid, and standard curves onto which the experimental results are interpolated. 
Quantitation of multiple samples may be speeded up by runnmg the assay in an ELISA format 
where the oligomer of interest is presented in various dilutions and a spectrophotometric or 
colorimeinc response gives rapid quantitation. A definitive diagnosis of this type may allow 
health professionals to begin aggressive treatment and prevent further worsening of the condition. 

25 Similarly, further assays can be used to monitor the progress of a patient dunng treatment. 
Furthermore, the nucleotide sequences disclosed herein may be used in molecular biology 
techniques that have not yet been developed, provided the new techniques rely on properties of 
nucleotide sequences that are currently known such as the triplet genetic code, specific base pair 
interactions, and the like. 

30 Therapeutic Use 

Based upon its homology to the genes encodmg SURs and its expression profile, the 
SURH polynucleotide disclosed herein ma> provide the basis for the design of molecules for the 
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treatment of diseases such as NIDDM. PHHL and other diseases associated with the SUR and/or 
the K^Tp channel. 

Expression vectors derived from retroviruses, adenovirus, herpes or vaccinia viruses, or 
from various bacterial plasmids, may be used for delivery- of nucleotide sequences to the targeted 
5 organ, tissue or ceil population. Methods which are well known to those skilled in the an can be 
used to construct recombinant vectors which will express antisense SURH. See, for example, the 
techniques described in Sambrook et al (supra) and Ausubel et al (supra). 

The polynucleotides comprising full length cDNA sequence and/or us regulators- 
elements enable researchers to use SURH as an investigative tool in sense (Youssoufian H and 
10 HFLodish 1993 Mol Cell Biol 13:98-104) or antisense (Eguchi et al (1991) Annu Rev Biochem 
60:63 1-652) regulation of gene function. Such technology is now well known in the an. and 
sense or antisense oligomers, or larger fragments, can be designed from various locations along 
the coding or control regions. 

Genes encoding SURH can be turned off by transfecting a cell or tissue with expression 
1 5 vectors which express high levels of a desired SURH nucleotide fragment. Such constructs can 
flood ceils with untranslatable sense or antisense sequences. Even in the absence of integration 
into the DNA. such vectors may continue to transcribe RNA molecules until all copies are 
disabled by endogenous nucleases. Transient expression may last for a month or more with a 
non-replicating vector (Mettler L personal communication) and even longer if appropriate 
20 replication elements are part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
antisense molecules, DNA. RNA or PNA, to the control regions of SURH. ie. the promoters, 
enhancers, and introns. Oligonucleotides derived from the transcription initiation site, eg, 
between -10 and -^10 regions of the leader sequence, are preferred. The antisense molecules may 
25 also be designed to block translation of mRNA by preventing the transcript from binding to 
ribosomes. Similarly, inhibition can be achieved using "triple helix" base-pairing methodology. 
Triple helix pairing compromises the ability of the double helix to open sufficiently for the 
binding of polymerases, transcription factors, or regulator)' molecules. Recent therapeutic 
advances using triplex DNA were reviewed by Gee JE et al (In: Huber BE and Bl Carr ( 1 994) 
30 Molecular and Immunoiopic Approaches . Futura Publishing Co. Mt Kisco NY). 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of 
RNA. The mechanism of riboz>'me action involves sequence-specific hybridization of the 
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ribozyme molecule to complementary target RNA. followed by endonucleolytic cleavage. 
Within the scope of the invention are engineered hammerhead motif ribozyme molecules that can 
specifically and efficiently catalyze endonucleolytic cleavage of RNA encoding SURH. 

Specific ribozyme cleavage sites within any potential RNA target are iniiially identified 
5 by scanning the target molecule for ribozyme cleavage sites which include the following 

sequences, GUA, GUU and GUC. Once identified, short RNA sequences of between 15 and 20 
ribonucleotides corresponding to the region of the target gene containing the cleavage site may be 
evaluated for secondary structural features which may render the oligonucleotide inoperable. The 
suitability of candidate targets may also be evaluated by testing accessibility to hybridization 

10 with complementar>^ oligonucleotides using ribonuclease protection assays. 

Antisense molecules and ribozymes of the invention may be prepared by any method 
known in the art for the synthesis of RNA molecules. These include techniques for chemically 
synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 

1 5 sequences encoding SURH. Such DNA sequences may be incorporated into a wide variety of 
vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, antisense 
cDNA constructs that synthesize antisense RNA constitutively or inducibly can be introduced 
into cell lines, cells or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 

20 modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule or the use of phosphorothioate or 2' ()-meihyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of 
PNAs and can be extended in ail of these molecules by the inclusion of nontraditional bases such 
as inosine. queosine and wybutosine as well as acetyl-, methyl-, ihio- and similarly modified 

25 forms of adenine. c>iidine. guanine, thymine, and uridine which are not as easily recognized by 
endogenous cndonucleases. 

Methods for introducing vectors into cells or tissues include those methods discussed 
infra and which are equally suitable for in vivo , in vitro and cx vivo therapy, For ^ vivo 
therapy, vectors are introduced into stem cells taken from the patient and clonally propagated for 

30 autologous transplant back into that same patient is presented in US Patent Nos. 5,399.493 and 
5,437,994. disclosed herein by reference. Dehverv' by transfection and by liposome are quite 
well known in the art. 
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Funhermore. the nucleotide sequences for SURH disclosed herein may be used in 
molecular biology techniques that have not yet been developed, provided the new techniques rely 
on properties of nucleotide sequences that are currently known, including but not limited to such 
properties as the triplet genetic code and specific base pair interactions. 
5 Detection and Mapping of Related Polynucleotide Sequences 

The nucleic acid sequence for SURH can also be used to generate hybridization probes 
for mapping the naturally occurring genomic sequence. The sequence may be mapped to a 
particular chromosome or to a specific region of the chromosome using well known techniques. 
These include in situ hybridization to chromosomal spreads, flow-sorted chromosomal 
10 preparations, or artificial chromosome constructions such as yeast artificial chromosomes, 
bacterial artificial chromosomes, bacterial PI constructions or single chromosome cDNA 
libraries as reviewed in Price CM (1993: Blood Rev 7:127-34) and Trask BJ f I99I: Trends Genet 
7:149-54). 

The technique of fluorescent in sina hybridization of chromosome spreads has been 
1 5 described, among other places, in Verma et ai (1 988) Human Chromosomes : A Manual of Bagic 
Techniques . Pergamon Press. New York NY. Fluorescent in situ hybridization of chromosomal 
preparations and other physical chromosome mapping techniques may be correlated with 
additional genetic map data. Examples of genetic map data can be found in the 1994 Genome 
Issue of Science (265:19810. Correlation between the location of a SURH on a physical 
20 chromosomal map and a specific disease {or predisposition to a specific disease) may help 

delimit the region of DNA associated with that genetic disease. The nucleotide sequences of the 
subject invention may be used to detect differences in gene sequences between normah carrier or 
affected individuals. 

Is hybridization of chromosomal preparations and physical mapping techniques such 
25 as linkage analysis using established chromosomal markers may be used for extending genetic 
maps. For example, an STS based map of the human genome was recently published by the 
Whiiehead-MIT Center for Genomic Research (Hudson TJ et al (1995 ) Science 270:1945-1954). 
Often the placement of a gene on the chromosome of another mammalian species such as mouse 
(Whitehead Institute/MIT Center for Genome Research. Genetic Map of the Mouse. Database 
30 Release 1 0. April 28. 1995) may reveal associated markers even if the number or arm of a 
particular human chromosome is not known. Ncw^ sequences can be assigned to chromosomal 
arms, or parts thereof by physical mappme. This provides valuable information to investigators 
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searching for disease genes using positional cloning or other gene discover}^ techniques. Once a 
disease or syndrome, such as ataxia telangiectasia (AT), has been crudely localized by genetic 
linkage to a particular genomic region, for example. AT to 1 lq22-23 (Gatti ei al (1988) Nature 
336:577-580), any sequences mapping to that area may represent associated or regulatory genes 

5 for further investigation. The nucleotide sequence of the subject invention may also be used to 
detect differences in the chromosomal location due to translocation, inversion, etc. among 
normal, carrier or affected individuals. 
Pharmaceutical Compositions 

The present invention relates to pharmaceutical compositions which may comprise 

10 nucleotides, proteins, antibodies, agonists, antagonists, or inhibitors, alone or in combination 
with at least one other agent, such as stabilizing compound, which may be administered in any 
sterile, biocompatible pharmaceutical carrier, including, but not limited to, saline, buffered saline 
dextrose, and water. Any of these molecules can be administered to a patient alone, or in 
combination with other agents, drugs or hormones, in pharmaceutical compositions where it is 

1 5 mixed with excipient(s) or pharmaceutically acceptable carriers. In one embodiment of the 
present invention, the pharmaceutically acceptable carrier is pharmaceutically inert. 
Administration of Pharmaceutical Compositions 

Administration of pharmaceutical compositions is accomplished orally or parenterally. 
Methods of parenteral delivers' include topical, intra-anenai (directly to the tumor), 

20 intramuscular, subcutaneous, intramedullary', intrathecal, intraventricular, intravenous, 
intraperitoneal, or intranasal administration. In addition to the active ingredients, these 
pharmaceutical compositions may contain suitable pharmaceutically acceptable carriers 
comprising excipients and auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutically. Further details on techniques for formulation 

25 and administration may be found in the latest edition of "Remmgion's Pharmaceutical Sciences" 
(Maack Publishing Co, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the an in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets 

30 pills, dragees. capsules, liquids, gels, syrups, slurries, suspensions and the like, for ingestion by 
the patient. 

Pharmaceutical preparations for oral use can be obtained through combination of active 
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compounds with solid excipient, optionally grinding a resulting mixture, and processing the 
mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. 
Suitable excipients are carbohydrate or protein fillers such as sugars, including lactose, sucrose, 
mannitol, or sorbitol: starch from com. wheat, rice, potato, or other plants; cellulose such as 
5 methyl cellulose, hydroxypropylmethyi-cellulose. or sodium carboxymethylcelluiosc: and gums 
including arabic and tragacanth; and proteins such as gelatin and collagen. If desired, 
disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl 
pyrroiidone. agar, alginic acid, or a salt thereof, such as sodium alginate. 

Dragee cores are provided with suitable coatings such as concentrated sugar solutions. 

10 which may also contain gum arabic. talc, polyvinylpyrrolidone, carbopol gel, polyethylene 
glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent 
mixtures. Dyesiuffs or pigments may be added to the tablets or dragee coatings for product 
identification or to characierize the quantity of active compound, ie, dosage. 

Pharmaceutical preparations which can be used orally include push-fu capsules made of 

15 gelatin, as well as soft, sealed capsules made of gelatin and a coating such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with a filler or binders such as lactose or 
starches, lubricants such as talc or magnesium stearate. and. optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatT\' 
oils, liquid paraffin, or liquid polyethylene glycol with or without stabilizers. 

20 Pharmaceutical formulations for parenteral administration include aqueous solutions of 

active compounds. For injection, the pharmaceutical compositions of the invention may be 
formulated in aqueous solutions, preferably in physiologically compatible buffers such as 
Hanks's solution. Ringer's solution, or physiologically buffered saline. Aqueous injection 
suspensions may contain substances which increase the viscosity of the suspension, such as 

25 sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the active 
compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic 
solvents or vehicles include fany oils such as sesame oil. or synthetic fatty acid esters, such as 
ethyl oieate or triglycerides, or liposomes. Optionally, the suspension may also contain suitable 
stabilizers or agents which increase the solubility of the compounds to allow for the preparation 

30 of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the panicular barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the an. 
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Manufacture and Storage 

The pharmaceutical compositions of the present invention may be manufactured in a 
manner that known in the an, eg, by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping or lyophilizing processes. 

5 The pharmaceutical composition may be provided as a salt and can be formed with many 

acids, including but not limited to hydrochlorie. sulfuric, acetic, lactic, tartaric, malic, succinic, 
etc. Salts tend to be more soluble in aqueous or other protonic solvents that are the 
corresponding free base forms. In other cases, the preferred preparation may be a lyophilized 
powder in 1 mN4-50 mM histidine. 0,l%-2% sucrose, 2%-7% mannitol at a pH range of 4.5 to 5.5 

1 0 that is combined with buffer pnor to use. 

After pharmaceutical compositions comprising a compound of the invention formulated 
in a acceptable carrier have been prepared, they can be placed in an appropriate container and 
labeled for treatment of an indicated condition. For administration of SURil, such labeling 
would include amount, frequency and method of administration. 

15 Therapeuticaliv Effective Dose 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve the 
intended purpose. The determination of an effective dose is well within the capability of those 
skilled in the art. 

20 For any compound, the therapeutically effective dose can be estimated initially either in 

ceil culture assays, eg, of neoplastic cells, or in animal models, usually mice, rabbits, dogs, or 
pfgs. The animal model is also used to achieve a desirable concentration range and route of 
administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

25 A therapeutically effective dose refers to that amount of protein or its antibodies. 

antagonists, or inhibitors which ameliorate the symptoms or condition. Therapeutic efficacy and 
toxicity of such compounds can be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, eg, ED50 (the dose therapeutically effective in 50% of the 
population) and LD50 (the dose lethal to 50% of the population). The dose ratio between 

30 therapeutic and toxic effects is the therapeutic index, and it can be expressed as the ratio, 

LD50/ED50. Pharmaceutical compositions which exhibit large therapeutic indices are preferred. 
The data obtained from cell culture assays and animal studies is used in formulating a range of 
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dosage for human use. The dosage of such compounds lies preferably within a range of 
circulating concentrations that include the ED50 with little or no toxicity. The dosage varies 
within this range depending upon the dosage form employed, sensitivity of the patient, and the 
route of administration. 
5 The exact dosage is chosen by the individual physician in view of the patient to be 

treated. Dosage and administration are adjusted to provide sufficient levels of the active moiety 
or to maintain the desired effect. Additional factors which may be taken into account include the 
severity of the disease state, eg. tumor size and location; age. weight and gender of the patient; 
diet, time and frequency of administration, drug combination(s). reacdon sensitivities, and 

10 tolerance/response to therapy. Long acting pharmaceutical compositions might be administered 
ever>' 3 to 4 days, every week, or once every two weeks depending on half-life and clearance rate 
of the particular formulation. 

Normal dosage amounts may var\^ from 0. 1 to 100.000 micrograms, up to a total dose of 
about 1 g, depending upon the route of administration. Guidance as to particular dosages and 

15 methods of deliver}- is provided in the Hteraiure. See US Patent Nos. 4.657.760; 5.206.344: or 
5,225.212. Those skilled in the art will employ different formulations for nucleotides than for 
proteins or their inhibitors. Similarly, delivery of polynucleotides or polypeptides will be 
specific to particular cells, conditions, locations, etc. 

The examples below are provided to illustrate the subject mvention and are not included 

20 for the purpose of limiting the invention. 

INDUSTRIAL APPLICABILITY 
I cDN.A Librar\' Construction 

The BRAINOT03 cDNA library was constructed from normal brain tissue removed from 
a 26 year old male (lot #0003; Mayo Clinic, Rochester MN). The frozen tissue was homogenized 

25 and lysed using a Brinkmann Homogenizer Polytron PT-3000 (Brinkmann Instruments, 

Westbur)' NJ). The reagents and extraction procedures were used as supplied in the Straiagene 
RNA Isolation Kit (Cat. # 200345: Stratagene). The lysate was centriftiged over a 5.7 M CsCl 
cushion using an Beckman SW28 rotor in a Beckman L8-70M Ultracentrifuge (Beckman 
Instruments) for 1 8 hours at 25.000 rpm at ambient temperature. The RNA was extracted once 

30 with phenol chloroform pH 8.0. once with acid phenol pH 4.0. precipitated using 0.3 M sodium 
acetate and 2.5 volumes of ethanoL resuspended in water and DNase treated for 1 5 min at 37°C. 
The RNA was isolated using the Qiagen Oligoiex kit (QI AGEN Inc. Chatsworth CA) and used to 
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construct the cDNA ltbrar>'. 

The RNA was handled according to the recommended protocols in the Superscript 
Plasmid System for cDNA Synthesis and Plasmid Clonmg (Cat. #18248-013: Gibco/BRL). 
cDNAs were fraciionated on a Sepharose CL4B coiumn (Cat. #275105. Pharmacia), and those 
5 cDNAs exceeding 400 bp were ligated into pSport I. The plasmid pSport I was subsequently 
transformed into DH5a™ competent cells (Cat #18258-012, Gibco/BRL). 

II Isolation and Sequencing of cDNA Clones 

Plasmid DNA was released from the cells and purified using the Miniprep Kit (Cat. # 
77468; Advanced Genetic Technologies Corporation. Gaithersburg MD). This kit consists of a 

10 96 well block with reagents for 960 purifications. The recommended protocol was employed 
except for the following changes: 1) the 96 wells were each filled with only 1 ml of sterile 
Terrific Broth (Cat. # 2271 1 . LIFE TECHNOLOGIEStm, Gaithersburg MD) with carbeniciilin at 
25 mg/L and glycerol at 0.4%: 2) the bacteria were cultured for 24 hours after the wells were 
inoculated and then lysed with 60 ul of lysis buffer: 3) a centrifugation step employing the 

1 5 Beckman GS-6R fa2900 rpm for 5 min was performed before the contents of the block were 
added to the primary filter plate: and 4) the optional step of adding isopropanol to TRIS buffer 
was not routinely performed. After the last step in the protocol, samples were transferred to a 
Beckman 96-well block for storage. 

The cDNAs were sequenced by the method of Sanger \ and AR Coulson (1975: J Mol 

20 Biol 94:441 f). using a Hamilton Micro Lab 2200 (Hamilton. Reno NV) in combination with four 
Peltier Thermal Cyclers (PTC200: MJ Research, Wateriovvn MA) and Applied Biosystems 377 
DNA Sequencing Systems (Perkin Elmer), and reading frame was determined. 

III Homologj' Searching of cDNA Clones and Their Deduced Proteins 

Each cDNA was compared to sequences in GenBank using a search algorithm developed 
25 by Applied Biosystems and incorporated into the INHERIT- 670 Sequence Analysis System. In 
this algorithm. Pattern Specification Language (TRW Inc. Los Angeles C A) was used to 
determine regions of homology. The three parameters that determine how the sequence 
comparisons run were window size, window offset, and error tolerance. Using a combination of 
these three parameters, the DNA database was searched for sequences containing regions of 
30 homology to the query sequence, and the appropriate sequences were scored with an initial value. 
Subsequently, these homologous regions were examined using dot matrix homology plots to 
distinguish regions of homology from chance matches. Smith- Waterman alignments were used 
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to display the results of the homoiogy search. 

Peptide and protein sequence homologies were ascertained using the INHERIT''' 670 
Sequence .Ajiaiysis System in a way similar to that used in DNA sequence homologies, Pattern 
Specification Language and parameter windows were used to search protein databases for 
5 sequences containing regions of homology which were scored with an initial value. Dot-matrix 
homology plots were examined to distinguish regions of significant homology from chance 
matches. 

BLAST, which stands for Basic Local Alignment Search Tool (Altschul SF (1993) J Mol 
Evol 36:290-300: Altschul. SF et al (1990) J Mol Biol 21 5:403-10), was used to search for local 

10 sequence alignments. BLAST produces alignments of both nucleotide and amino acid sequences 
to determme sequence similarity. Because of the local nature of the alignments. BLAST is 
especially useful in deierminmg exact matches or in identifying homologs. BLAST is useful for 
matches which do not contain gaps. The fundamental unit of BLAST algorithm output is the 
High-scoring Segment Pair (HSP). 

15 An HSP consists of two sequence fragments of arbitrary but equal lengths whose 

alignment is locally maximal and for which the alignment score meets or exceeds a threshold or 
cutoff score set by the user. The BLAST approach is to look for HSPs between a query sequence 
and a databa.se sequence, to evaluate the statistical significance of any matches found, and to 
report only those matches which satisfy the user-selected threshold of significance. The 

20 parameter E establishes the statistically significant threshold for reporting database sequence 
matches. E is interpreted as the upper bound of the expected frequency of chance occurrence of 
an HSP (or set of HSPs) within the context of the entire database search. Any database sequence 
whose match satisfies E is reported in the program output. 
IV Northern Analysis 

25 Northem analysis is a laboratory technique used to detect the presence of a transcript of a 

gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
RNAs from a particular cell type or tissue have been bound (Sambrook et al. supra). 

.Analogous computer techniques use BLAST (Altschul SF 1993 and 1990. supra) to 
search for identical or related molecules in nucleotide databases such as GenBank or the 

30 LIFESEQ"^^ database (Incyte, Palo Alto CA). This analysis is much faster than mukipie, 
membrane-based hybridizations. In addition, the sensitivity of the computer search can be 
modified to determine whether any particular n-.atch is categorized as exact or homologous. 
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The basis of the search is the product score which is defined as: 

% sequence identity x % maximum BLAST score 
100 

and it takes into account both the degree of similarity between two sequences and the length of 
5 the sequence match. For example, with a product score of 40, the match will be exact within a 1- 
2% error; and at 70. the match will be exact. Homologous molecules are usually identified by 
selecting those which show product scores between 15 and 40, although lower scores may 
identify related molecules. 

The results of the search are reported as a list of libraries in which the SURH encoding 
10 sequence occurs. Abundance and percentage abundance of the SURH encoding sequence are 
also reported. Abundance directly reflects the number of times a particular transcript is 
represented in a cDNA librar\\ and percent abundance is abundance divided by the total number 
of sequences examined in the cDNA libran-. 

V Extension of SURH to FuU Length or to Recover Regulatory Elements 

15 The nucleic acid sequence encoding SURH (SEQ ID N0:2) is used to design 

oligonucleotide primers for extending a partial nucleotide sequence to full length or for obtaining 
5' sequences from genomic libraries. One primer is synthesized to initiate extension in the 
antisense direction (XLR) and the other is synthesized to extend sequence in the sense direction 
(XLF). Primers allow the extension of the known SURH nucleotide sequence "outward" 

20 generating amplicons containing new. unknown nucleotide sequence for the region of interest 
(US Patent Apphcaiion 08/487.1 12. filed June 7. 1995. spccificaiiy incorporated by reference). 
The initial primers are designed from the cDNA using OLIGC"' 4.06 Primer Analysis Software 
(National Biosciences), or another appropriate program, to be 22-30 nucleotides in length, to 
have a GC content of 50% or more, and to anneal to the target sequence at temperatures about 

25 68°-72'' C. Any stretch of nucleotides which would result in hairpin structures and primer- 
primer dimerizations is avoided. 

The original, selected cDNA libraries, or a human genomic librar\' arc used to extend the 
sequence; the latter is most useful to obtain 5' upstream regions. If more extension is necessary 
or desired, additional sets of primers are designed to further extend the known region. 

30 By following the instructions for the XL-PCR kit (Perkin Elmer) and thoroughly mixing 

the enz\'me and reaction mix. high fidelity aa:Dlincation is obtained. Beginning with 40 pmoi of 
each primer and the recommended concentration^ of all other components of the kit. PGR is 
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perlbrmed using the Peltier Thermal Cycler (PTC200: MJ Research. Watenown MA) and the 

following parameters: 

Step 1 94° C for 1 min (initial denaturation) 

Step 2 65°Cfor 1 min 

5 Steps 68° C for 6 min 

Step 4 94° C for 15 sec 

Step 5 65'' C for 1 min 

Step 6 68° C for 7 min 

Step 7 Repeat step 4-6 for 15 additional cycles 

10 Steps 94° C for 15 sec 

Step 9 65° C for 1 min 

Step 10 68° C for 7:15 min 

Step 1 1 Repeat step 8-10 for 12 cycles 

Step 12 72° C for 8 min 

15 Step 13 4° C (and holding) 

A 5-10 ul aliquot of the reaction mixture is analyzed by electrophoresis on a low 
concentration (about 0.6-0.8%) agarose mini-gel to determine which reactions were successful in 
extending (he sequence. Bands thought to contain the largest products were selected and cut out 

20 of the gel. Further purification involves using a commercial gel extraction method such as 
QIAQuick"'"'^ (QIAGEN Inc). After recovery of the DNA. Klenow enzyme was used to trim 
single-stranded, nucleotide overhangs creating blunt ends which facilitate religation and cloning. 

After ethanol precipitation, the products are redissolved in 13 ul of ligation buffer. \u\ 
T4-DNA ligase (15 units) and l ul T4 polynucleotide kinase are added, and the mixture is 

25 incubated at room temperature for 2-3 hours or overnight at 16° C\ Competent F. coli cells fin 
40 u\ of appropriate media) are transformed with 3 u\ of ligation mixture and cultured in 80 u[ of 
SOC medium (Sambrook J et al. supra). After incubation for one hour at 37"" C, the whole 
transformation mixture is plated on Luria Bertani (LB)-agar (Sambrook J ct aL supra) containing 
2xCarb. The following day, several colonies are randomly picked from each plate and cultured in 

30 150 u\ of liquid LB/2xCarb medium placed in an individual well of an appropriate, 

commercially-available, sterile 96-well microtiter plate. The following day, 5 ^t! of each 
overnight culture is transferred into a non-sterile 96'well plate and after dilution 1:10 with water, 
5 ^\ of each sample is transferred into a PCR array. 

For PCR amplification, 18 ,u\ of concentrated PCR reaction mix (3,3x) containing 4 units 

35 of rTth DNA polymerase, a vector primer and one or both of the gene specific primers used for 
the extension reaction are added to each well. Amplification is performed using the following 
conditions: 
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Step 1 
Step 2 
Steps 
Step 4 
Step 5 
Step 6 



94" C for 60 sec 
94 C for 20 sec 
55^ C for 30 sec 
72^ C for 90 sec 



5 



Repeat steps 2-4 for an additional 29 cycles 
72° C for 180 sec 



Step 7 4° C (and holding) 

Aiiquots of the PCR reactions are run on agarose gels together with molecular weight 
markers. The sizes of the PCR products are compared to the original partial cDNAs. and 
10 appropriate clones are selected, ligated into plasmid and sequenced. 
VI Labeling and Use of Hybridization Probes 

Hybridization probes derived from SEQ ID NO:2 are employed to screen cDNAs. 
genomic DNAs or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 
base-pairs, is specifically described, essentially the same procedure is used with larger cDNA 

15 fragments- Oligonucleotides are designed using state-of-ihe-ari software such as OLIGO 4.06 
(National Biosciences), labeled by combining 50 pmol of each oligomer and 250 mCi of [y-'^P] 
adenosine triphosphate (Amersham. Chicago ID and T4 polynucleotide kinase (DuPont NEN*. 
Boston MA). The labeled oligonucleotides arc substantially purified with Sephadex G-25 super 
fine resin column (Pharmacia). A ponion contaming 10' counts per minute of each of the sense 

20 and antisense oligonucleotides is used in a typical membrane based hybridization analysis of 
human genomic DNA digested with one of the following endonucieases (Ase 1, Bgl 11. Eco Rl, 
Psi L Xba 1 . or Pvu II: DuPom NEN^). 

The DNA from each digest is fractionated on a 0.7 percent agarose gel and transferred to 
nylon membranes (Tsyiran Plus. Schleicher &. Schucil. Durham Nl I), Hybridization is carried out 

25 for 16 hours at 40 ^C. To remove nonspecific signals, blots are sequentially washed at room 
temperature under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5% 
sodium dodecyl sulfate. After XOMAT AR™ film (Kodak. Rochester NY') is exposed to the 
blots in a Phosphoimager cassette (Molecular Dynamics, Sunnyvale CA) for several hours, 
hybridization patterns are compared visually. 

30 VII Antisense Molecules 

The nucleotide sequence encoding SURH, or any part thereof is used to inhibit in vivo or 
in vitro expression of naturally occurring SURil. Although use of antisense oligonucleotides, 
comprising about 20 base-pairs, is specifically described, essentiaily the same procedure is used 
with larger cDNA fragments. An oligonucleotide based on the coding sequence of SURH as 
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shown in Figures 1 A-IM is used lo inhibit expression of naturally occurring SURl-I. The 
complememar\' oHgonucieotide is designed from the most unique 5' sequence as shown in 
Figures 1 A-IM and used cither to inhibit transcription by preventing promoter binding to the 
upstream nontranslaied sequence or translation of an SURH transcript by preventing the 
5 ribosome from binding. Using an appropriate portion of the leader and 5' sequence of SEQ ID 
N0:2, an effective antisense oligonucleotide includes any 15t20 nucleotides spanning the region 
which translates into the signal or early coding sequence of the polypeptide as shown in Figures 
lA-lM. 

Vin Expression of SURH 

10 Expression of SURII may be accomplished by subcloning the cDNAs into appropriate 

expression vectors and transfecting the vectors into analogous expression hosts. In this case, the 
cloning vector . pSpon. previously used for the generation of the cDNA librar>' also provides for 
direct expression of SURH sequences in LI. coli . Upstream of the cloning site, this vector 
contains a promoter for B-galactosidase. followed by sequence containing the amino-terminal 

1 5 Met and the subsequent 7 residues of B-galactosidase. immediately following these eight 

residues is an engineered bacteriophage promoter useful for artificial priming and transcription 
and a number of unique restriction sites, including Eco RJ, for cloning. 

Induction of the isolated, iransfected bacterial strain with IPTG using standard methods 
will produce a fusion protein corresponding to the first seven residues of B-galactosidase. about 

20 15 residues of "linker", and the peptide encoded within the cDNA. Since cDNA clone inserts are 
generated by an essentially random process, there is one chance in three that the included cDNA 
will lie in the correct frame for proper translation, [f the cDNA is not in the proper reading 
frame, it can be obtained by deletion or insertion of the appropriate number of bases by well 
known methods including in vitro mutagenesis, digestion with exonuclease III or mung bean 

25 nuclease, or the inclusion of an oligonucleotide linker of appropriate length. 

The SURH cDNA can be shuttled into other vectors known to be useful for expression of 
protein in specific hosts. Oligonucleotide primers containing cloning sites as well as a segment 
of DNA (about 25 bases) sufficient to hybridize to stretches at both ends of the target cDNA can 
be synthesized chemicaiiy by standard methods. These primers can then used to amplify the 

30 desired gene segment by PCR. The resulting gene segment can be digested with appropriate 
restriction enzymes under standard conditions and isolated by gei electrophoresis. Alternately, 
similar gene segments can be produced by digestion of the cDNA with appropriate restriction 
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enzymes. Using appropriate primers, segments of coding sequence from more than one gene can 
be ligated together and cloned in appropriate vectors. U is possible to optimize expression by 
construction of such chimeric sequences. 

Suitable expression tiosts for such chimeric molecules include, but are not limited to. 
5 mammalian cells such as Chinese Hamster Ovar>' (CHO) and human 293 cells, insect cells such 
as Sf9 cells, yeast cells such as Saccharomvces cerevisiae . and bacteria such as £aJi. For each 
of these cell systems, a useful expression vector may also include an origin of replication to allow 
propagation in bacteria and a selectable marker such as the B-laciamase antibiotic resistance gene 
to allow plasmid selection in bacteria. In addition, the vector may include a second selectable 

10 marker such as the neomycin phosphotransferase gene to allow selection in transfected 
eukaryotic host cells. Vectors for use in eukaryotic expression hosts may require RNA 
processing elements such as 3' poiyadenylation sequences if such are not pan of the cDNA of 
interest- 
Additionally, the vector may contain promoters or enhancers which increase gene 

15 expression. Such promoters are host specific and include MMTV. SV40, and metallothionine 
promoters for CHO cells; trp, lac. tac and T7 promoters for bacterial hosts; and alpha factor, 
alcohol oxidase and PGH promoters for yeast. Transcription enhancers, such as the rous sarcoma 
virus enhancer, may be used in mammalian host cells. Homogeneous cultures of recombinant 
cells are obtained through standard culture methods. Cellular fractions from cells containing 

20 SURH are prepared by solubilization of the whole cell and isolation of subcellular fractions by 
differential centrifugaiion, by the addition of detergent, or by other methods well known in the 
art. These fractions can be used directly in the following assay. 
IX SURH Activity 

The SU binding activity of SURH or biologically active fragments thereof may be 

25 assayed in a competitive binding assay. The competitive binding of 5-['--I]iodo-2- 

hydroxyglyburide (-T-HGB: Nelson et al (1992) J Biol Chcm 267:14928-14933) and an 
unlabeled SU to SURH is measured by subsequent UV-crosslinkine of bound '-'l-HGB to the 
protein, Detergeni-solubilized or membrane-bound SURH. or soluble fragments of SURH. arc 
incubated with var>'ing concentrations of unlabeled SU plus a predetermined concentration of 

30 '"1-HGB until equilibrium is reached. Aliquots are irradiated at 312 nm to cross-link SURH- 
bound I-HGB to the protein. The irradiated protein samples are electrophoresed on SDS- 
polyacr>'lamide geis. The gels arc dried and subjected to autoradiography. Bands corresponding 
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to -labeled SURH are excised from the dried gels, and the radioactivity quantitated in a 
gamma radiation counter. Data obtained using different concentrations of unlabeled SUs are 
used to calculate values for the number, affinity, and association of SURJ-I with the candidate SU 
ligands using an equation such as presented in Nelson et al (supra). 
5 X Production of SURH Specific Antibodies 

SURH substantially purified using PAGE electrophoresis (Sambrook. supra) is used to 
immunize rabbits and to produce antibodies using standard protocols. 7*he ammo acid sequence 
translated from SURH is analyzed using DNAStar software (DNASiar Inc) to determine regions 
of high immunogenicity and a corresponding oligopolypeptide is synthesized and used to raise 
1 0 antibodies by means known to those of skill in the an. Analysis to select appropriate epitopes, 
such as those near the C-terminus or in hydrophilic regions is described by Ausubel FM et al 
(supra). 

Typically, the oligopeptides are 15 residues in length, synthesized using an Applied 
Biosystems Peptide Synthesizer Model 43 1 A using fmoc-chemistr>'. and coupled to keyhole 
15 limpet hemocyanin (KLH. Sigma) by reaction with M-maleimidobenzoyl-N-hydroxysuccinimide 
ester (MBS; Ausubel FM et al, supra). Rabbits are immunized with the oligopeptide-{CLH 
complex in complete Freund's adjuvant. The resulting aniisera are tested for antipeptide activity, 
for example, by binding the peptide to plastic, blocking with 1% BSA. reacting with rabbit 
antisera. washing, and reacting with radioiodinated, goat anti-rabbit IgG. 
20 XI Purification of Naturally Occurring SURH Using Specific Antibodies 

Naturally occurring or recombinant SURH is substantially purified by immunoaffmit) 
chromatography using antibodies specific for SURH. An immunoaffinity column is constructed 
by covalently coupling SURH antibody to an activated chromatographic resin such as 
CnBr-activated Sepharose (Pharmacia Biotech). After the coupling, the resin is blocked and 
25 washed according to the manufacturer's instructions. 

Cellular fractions from cells containing SURH are prepared by solubilization of the whole 
cell and isolation of subcellular fractions by differential centrifugation, by the addition of 
detergent, or by other methods well know^n in the art, 

A fractionated SURH-containing preparation is passed over the immunoaffinity column. 
30 and the column is washed under conditions that allow the preferential absorbance of SURH (eg, 
high ionic strength buffers in the presence of detergent). The column is eiutcd under conditions 
that disrupt antibody/SURH binding (eg. a buffer of pH 2-3 or a high concentration of a 
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chaoirope such as urea or thiocyanate ion), and SURH is collected. 

All publications and patents mentioned in the above specification are herein incorporated 
by reference. Various modifications and variations of the described method and system of the 
invention will be apparent to those skilled in the art without departing from the scope and spirit 
5 of the invention. Although the invention has been described in connection with specific preferred 
embodiments, it should be understood that the invention as claimed should not be unduly limited 
to such specific embodiments. Indeed, various modifications of the described modes for carrying 
out the invention which are obvious to those skilled in molecular biology or related fields are 
intended to be within the scope of the following claims. 

10 
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SEQUENCE LISTING 

{!) GENERAL IWFORMATION 

(i; AFPLICAMT: INCYTE PHARMACEUTICALS, INC. 

TITLE OE THE INVENTION: NOVEL HUMAN SULFONYLUREA ?,ECEPTOR 

till) NUM3ER OF SEQUENCES: 5 

(iv; CORRESPONDENCE ADDRESS: 

;A) ADDRESSEE: Incyre Pharmaceut Jicais , Inc. 
IB) STREET: 317^ Porter Drive 
{€) CITY: Palo Aico 
;D) STATE: CA 
:Z] COUNTRY: U.S- 
ZIP: 94304 

{v; COMPUTER READABLE FORJ^ : 
:A) MEDIUM TYPE: Oiskette 
■:B) COMPUTER; IBM Comoatiibie 
■Z) OPEPA-TIKG SYSTEM; DOS 
■:D) SOFTWARE: EastSFQ Version 1.5 

(vl; CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE; fiied Herewith 

(vii) PRIOR APPLICATION DATA: 

;A) APPLICATION NUMBER: US 08/726,320 
;3; FILING DATE: 02-OCT-1996 

(Viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Biliinqs, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

;C) REFERENCE/DOCKET NUMBER; AF-0001 PCT 

(i:-:; TELECOMMUNICATION INFORMATION: 
:A; TELEPHONE: 6 5 0-855-0555 
: 3 ; TELE F.A'--' : 650-645i-4166 

(2) i::rOR[-lATION for SEQ id N0:1; 

SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15S0 ammo acids 
;B) TYPE: amino acid 
■:C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

:ii) MOLECULE TYPE; pepcide 

(vii) IMMEDIATE SOURCE: 
(A) LIBRARY; 
{3' CLONE: Consensus 

i;-:.) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

t Pro Leu Ai^ Pne Cvs Giy Ser Glu A.^n His Ser Ala Ala Tyr Arq 

5 10 15 ' 

1 Aso 'oln Gi-.' Vai Leu Asn Asn Glv Cvs Phe Val Aso A^a Lou Asn 

20 25 " 30 
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Vai 


Vai 


Pro 


His 


Vai 


Phe 


Leu 


Leu 


Phe 


JL 1 e 


Thr 


Pne 


Pro 


He 


Leu 


r'ne 






35 










4 0 










45 








lie 


Gi V 


'I'rp 


Gly 


Ser 


Gin 


Ser 


Ser 


Lys 


Vai 


His 


lie 


His 


His 


Ser 


Thr 




50^ 










55 










60 










Tro 


Leu 


His 


Phe 


Pro 


Gly 


His 


Asn 


Leu 


Arq 


Tro 


He 


Leu 


Thr 


Phe 


Met 


65 










70 










75" 










80 


Leu 


Leu 


Phe 


Vai 


Leu 


Vol 


Cys 


Glu 


He 


Ala 


Giu 


Gly 


He 


Leu 


Ser 


Asp 










85 










90 










95 


Gly 


Vai 


Thr 


Giu 


Ser 


His 


His 


Leu 


His 


Leu 


Tyr 


Met 


Pro 


Ala 


Gly 


Met 








100 










105 










HC 




Aia 


Prie 


Met 


Ala 


Ala 


v' a 1 


Thr 


Ser 


Vai 


V al 


Tyr 


Tyr 


His 


Asn 


He 


Glu 






115 










120 










125 








Thr 


Ser 


A sr. 


Pne 


Pro 


L y s 


Leu 


Leu 


He 


Ala 


Leu 


Leu 


Vai 


Tyr 


T rp 


Thr 




130 










135 










140 










Leu 


Ala 


Pne 


lie 


Thr 


L V s 


Thr 


I le 


Lys 


Phe 


V a 1 


Lys 


Phe 


Leu 


Asp 


His 


145 










150 










155 








160 
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lie 


G 1 '/ 


Pne 


Ser 


Gin 


Leu 
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Cys 


Leu 


T h r 


Gly 


Leu 


Leu 


Vai 
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Civ 


Met 
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Leu 
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1 1 1^ 


Arg 


Vai 
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F' r, e 
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A. r d 


G 1 u 
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Pro 


Pro 


Glu 






195 










200 
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Asp 


Leu 


G 1 n 


Asp 


l.eu 


G 1 y 


Va 1 
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Leu 


Gin 


V r 0 
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V J ■ 


Asn 


Leu 




2 










215 










220 










Leu 


Sor 


Lys 


Gly 


Thr 




Trp 


Trp 


Met 


Asn 


Ala 


Phe 


lie 


Lys 


Th.r 
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225 










230 










235 










24 0 


His 


Lys 


Lys 


Pro 


Xaa 


Asp 


Le u 


Arg 
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He 


Gly 
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Leu 


Pro 


He 


Ala 










245 










250 










255 




Met 


Arg 


Aia 


Leu 


Thr 


Asn 


T y r 


Gin 


Arg 


Leu 


Cys 


Glu 


Ala 


Phe 


Asp 


Aia 








260 










265 










270 






Gin 


Arg 


Lvs 


Asp 


He 


Gin 


Gly 


Thr 


Gin 




Aia 


Arq 


Aia 


He 


Trp 


Gin 






275 










280 








285 








Ala 


Leu 


Ser 


His 
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Phe 


Gly 


Arg 
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Leu 
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Leu 


Ser 


Ser 


Thr 


Phe 




290 
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He 


I.eu 


Ala 


Asp 


Leu 


Le u 
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Tro Giu Asn lie Phe 
515 

Me:: Thr Set Leu Arc 
530 

Met Asn Thr Aia lie 
545 

His Vai Ser Phe Phe 
565 

Aia Ser Leu Ser Leu 
580 

Ser Ser Vai Vai Arg 
595 

Leu Ser G-u Phe Leu 
610 

Pro His Giu Pro Thr 
625 

Pro Leu Arq Vai Vai 

Giv Leu Thr Giv Pro 
660 

A- a ASD Asn Cvs Cvs 
675 

Pro AsD Giy lie Pro 
690 

Giy Gin Leu Thr Met 
705 

Leu Leu Leu Aia Aia 
725 

Phe TrD 5er Ser Ser 
740 

Pro Giu Arg Giu Thr 
755 

Vai Aia Tvr Aia Ser 
77C 

Giu Asn lie lie Phe 
785 

Vai lie Giu Aia Cys 
805 

Gly AsD Gin Thr Gin 
82C 

Gin Arg Gin Arc lie 
835 

Vai Vai Pne Leu Asp 
850 

Asc His Leu Me^ Gin 
865 

Ara Thr Vai Vai Leu 
885 

Asp Trp Lie lie Aia 
900 

Ser Arg Thr Ser Ara 
915 

Pro His Giu Pre Thr 
930 

Arg Lys Aia Thr Giu 
9^5 

Arg AsD Giy Leu Leu 
965 

Giu Asn Giu Gi- Asp 
980 



Arg Tnr Arg Vai Giu Thr 
520 

Aia Pne Aia lie Tvr Thr 
535 

Pro lie Aia Aia Vai Leu 
550 555 
Lvs G^u Aia Asp Phe Ser 
570 

Phe His lie Leu Vai Thr 
585 

Ser Thr Vai Lys Aia Leu 
600 

Ser Ser Aia Giu lie Arg 
615 

Pro Gin Giv Pro Aia Ser 
630 ' 635 

Asn Arg Lys Arg Pro Ala 
650 

Leu Gin Ser Leu Vai Pro 
665 

Z^:. i'^o Met Giy Giv 
680 

Thr Leu Scr Asn lie Thr 
695 

lie Vai Giv Gin Vai Giy 
710 ' 715 

Leu Giv Giu Met Gin Lvs 
730 

Leu Pro Asp Ser Giu lie 
745 

Aia Thr Asd Leu Asp lie 
760 

Gin Lvs Pro Trp Leu Leu 
775 

Giu Ser Pro Phe Asn Lys 
"790 795 
Ser Leu Gin Pro Asd lie 
810 

lie G-v Giu Arg Giv lie 
825 

Ser Vai Aia Arg Aia Leu 
QAO 

Asd Pro Phe Ser Aia Leu 
855 

Aia Gly lie Leu Giu Leu 
870 875 
Vai Thr Kis Lvs Leu Gin 
890 

Met. L-/S Asd Giy Thr i^e 
905 

Giy Leu Asn Aia Ser Ser 
920 

G.y Pro Aro Aia Giy Giu 
935 

Pro Pro Arg Aia T'/r Leu 
950 ' 955 

Gin Aso Giu Giu Giu Giu 
970 

Asd T y r Leu Ser Ser Met 
985 



Thr 


Aro 


Arg 


Lys 


Giu 




525 








Ser 


lie 


Ser 


I ie 


Phe 


540 










lie 




Phe 


Va 1 


Giv 










560 


Pro 


Ser 


Vai 


Aia 


Phe 








575 




Pro 


Leu 


Pne 


Leu 


Leu 






590 






Vai 


Ser 


Va i 


Gin 


Lys 




605 








Giu 


i u 


Gin 


Cys 


Aia 


620 










Lys 


Tyr 


Gin 


Aia 


Vai 










640 


Arg 


Giu 


Asp 


Cys 


Arg 








655 




Ser 


Aia 


Asp 


Giy 


Asp 






67 C 






Tyr 


Pne 




T rp 


Thr 




685 








lie 


Arg 


lie 


Pro 


Arg 


700 










Cys 


Giy 


Lys 


Ser 


Ser 










720 


Vai 


Ser 


Gly 


Aia 


Vai 








735 




Giy 


Giu 


Asd 


Pro 


Ser 






75G 






Arg 


Lys 


Arg 


Giy 


Pro 




765 








Asn 


Aia 


Thr 


Vai 


Giu 


780 










G 1 n 


Arg 


Tyr 


Lys 


Met 










800 


Asp 


lie 


Leu 


Pro 


His 








815 




A.sn 


Leu 


Ser 


Giy 


Giy 






830 






Tyr 


Gin 


His 


Ala 


Asn 




845 








Asd 


I ie 


His 


Leu 


Ser 


8 60 










Leu 


Arg 


Asp 


Asp 


Lys 










880 


Tyr 


Leu 


Pro 


His 


Aia 








895 




Gin 


hrq 


Giu 


Giy 


Thr 






910 






Leu 


Ser 


Thr 


Giy 


Arg 




925 










Asn 


S' ^ 


Thr 


Giu 


940 










Vai 




Cys 


Pro 


Ara 










960 


Giu 


Giu 


Giu 


Al a 


Aia 








975 




Leu 


His 


Gin 


Arg 


Aia 



990 
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Glu lie Pro Trp Arg Aia C'/s Xaa hvs Tyr Leu Ser Ser Ala Giy lie 

995 ' 1000 ' 1005 

Leu Lou i^eu Ser Leu Leu Val Pr.e Ser Gin Leu Leu Lys n^s Met ';al 

lOlO 1015 1020 

Leu Val Ala lie Asd Tyr Tro Leu Ala Lvs Trc T.^.r Asd Ser Aia Leu 

025 " 1030 ' ' 1035 ' 1040 

Thr Leu Tnr Pro Aia Thr Ara A.sr. Cys Ser Leu Asn Gin Glu Cys Tr.r 

1045 ^ 1050 1055 

Leu Asn G.n Thr Val Tyr Ala Leu Val Phe Thr Vai Leu Gys Ser Leu 

1060 1065 1070 

Giy He Vai Leu Cys Leu Val Tr.r Ser Vai Thr Vr.l Glu Trp Thr Gly 

1C75 1080 iCS5 



Leu 


L V s 




Aia 


Lys Arg 


Le u 


I*. 1 s 


A r a 


Ser 


Le u 




.'VS., 




lie 




1090 






1095 








HOC 










Leu 


Aia 


r ro 


Met: 


Arg Phe 


Pr.e 


' 


Thr 


T n r 


r 0 


Leu 


o i y 


i-"" c r 


lie 




105 








i 1 1 0 








1115 










i _ .i; I-' 


Asn 


Arg 


ne 


Ser 


Ser Asp 


Cys 


s r. 




lie 


Asp 


Gin 


His 


lie 


Pro 


Ser 










1125 






1130 










1 1 JZi 




Thr 


Leu 


Glu 




Leu Sor 


Arg 




T h r 


Leu 


-_.t;U 


C y s 


, , ■] 




M X d 


i^eu 






114 0 








114 5 
















Ala 


Veil 


- 1 e 




y r Vai 


' •"' 




V'' a 1 


P T'. e 


Leu 


V a 1 


/ii a 


„ c u 


Lj r, ^ 


_ 






.155 








* 1 6 C 










i 1 0 








Leu 


Ala 


I * e 


Val 


Cys Tyr 






'j J. r; 


1. y s 


Tyr 


t\ ie 


i,' g 




Aia 




1170 








1 1 










1 1 8 C 










Arq 


Asp 


Leu 


Gin 


Gin Leu 


Asp 




Thr 


1' n r 


G i n 


L e u 


n y- — , 


^ e u 


^eu 


i e r 


185 








1 190 










1195 










i uu 


His 


Phe 


Ala 


Glu 


Thr Vai 


Glu 


^ - 


Leu 


Thr 


Tnr 


He 


f\ r g 


.'■-i X a 


Ph6 


Arg 










1205 








X Z lU 










X *c 1 O 




Tyr 


Glu 


A i a 


Arg 


Phe Gin 


(o 1, n 


« y 3 


Leu 


i-je u 


G iu 


T y r 


Thr 


Asc 


S c r 


A. s n 






1220 








X ^ ^ ~j 










J, J \J 






Asn 


lie 


Aia 


Ser 


Leu rne 


i.e u 




Aia 


Ala 


Asn 


Arg 




Leu 


G i u 


Val 






.235 








J. c. H 'J 










12 4 5 








Arg 


Met 


Glu 


Tyr 


1 X e <j I J 


Zi ) A 


y i) 


Val 


Vai 


Leu 


T 1 a 

J. J- „ 


Al a 


Ala 


Va i 


Thr 




1250 








1 O c r 

1 jL. ^ -) 










i o u 










Ser 


lie 




Asr. 


Ser Leu 


His 


r "3 


Glu 


Leu 


Ser 


/ \ 1 a 




■ ■ , 


V a i 


Giv 


265 








1270 










1275 










1280 


Leu 


G 1 y 


Leu 


Thr 


Tyr Aia 




Ke ^ 


a 1 


Ser 


Asn 






Asn 


I'ro 


Met 










1285 








1290 










12 95 




Val 


Arq 


As n 


Le u 


Ala Asp 


Met 


J u 


Lou 


Gin 


Leu 




A J. J 




Lys 


Arg 








130C 








1305 
















lie 


Hi3 




Leu 


Leu Lys 






.-^.1 a 


G 1 u 


Ser 








Leu 


Leu 






1315 








1320 










1325 








Aia 


Pro 


Ser 


Leu 


lie Pro 


Lys 


s n 


Trp 


Pro 


Asp 


G Ir; 






He 


G In 




1330 








1335 










1340 










lie 


Gin 


Asn 


Leu 


Ser Val 


Arg 


T y r 


s p 


Ser 


Ser 


Leu 


Lys 


Pro 


Vai 


Leu 


345 








1350 










1355 










1360 


Lys 


His 


Vai 


Asn 


Aia Leu 




5e r 


Pre 


Gly 


Gin 




I le 


■■J 1 y 


He 


Cys 










1365 








1370 










1375 




G i y 


Arg 




G 1 


Ser Gly 


Lys 




Ser 


Phe 


S r: t 


Leu 


Aj^ a 




phe 


Arg 








1380 








1385 










13 90 






Me:: 




Aso 


Tr.r 


Phe Glu 






1 1- 


I io 


I i e 


Asd 


G 1 V 




Asp 


He 






1395 








14 GO 










1405 








Aia 


Lvs 


Leu 


Pro 


Leu Hrs 






/\ r a 


S e r 


Arg 


L e u 


Ser 


: ie 


He 


Leu 




14 10 








14 15 










14 20 










G -L r. 


Asp 


1 ro 




Leu Phe 






Thr 


I i.e 


Arg 


Pne 


HS n 




Asp 


Pro 


425 








143C 










1435 










1440 


G 1 u 


Arg 


Lys 




Ser Asc 






Le u 


1 rD 


Glu 




Leu 


o - U 


lie 


Aid 










1445 








:4 50 










1455 




'J — p. 


Leu 




Leu 


V u 1 V 1 


\' s 






t : ' ' 






Leu 


As o 


Ala 


: le 



14 60 14 6:* 1 4 70 
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lie Th r G i u G 1 ■/ G 1 y G 1 u A-S n Phe S e r Gin G I y Gin Arcs Gin Le u Phe 

1475 148 0 1'185 

Cys Leu hla Ara Ala Phe Val Arq Lvs Thr Ser lie Phe lie Met Asc 

1490 1495 1500 

Glu Ala Thr A_= Ser lie Aso Met Ala Thr Glu Asn He Lc'j Gin Lvs 

505 1510 1515 1520 

Val Val Xet Thr Ala Pne Ala Asp Arq Thr Val Val Thr He Ala His 

1525 1530 1535 

Arg Val His Tr.r lie Leu 5c r Ala Aso Leu Vai lie Vai Leu Lvs Arq 

1540 1545 1550 

Giv Ala lie Leu Glu Pne Aso Lvs Pro Glu Lvs Leu Leu Ger Arq Lvs 

1555 1560 1565 

ASD Ser Vai Phe Ala Ser Pne Vai Arg Ala Aso Lys 

1570 157 5 ' " 1580 

{2:; :::roRMAT:oN for seo i:^ m:2: 

SEQGZNCE CHARACTERISTICS: 
(A) LI::GTH: 4931 oase pairs 
■:B} TYPE: nucleic acid 
■ C; :^T?A.NDEDMES3 : r.^r.ql^ 
-.u; TGPOLOGV; linear 

(11) molecule; TYPL: cDNA 

(vii} :m:-^£diate source: 

(A) LIBRARY: 

(B; CLONE: Consensus 

(xi) SEQ'JENCE DESCRIPTION: SEQ ID N0:2: 

GAAT7CCCGG GTCGACCCAC GCGTCCGCCG CGCCCGCGCC GZCATGCCCC TGGCCTTCTG 60 

CGGCAGCGAG AACCACTCGG CCGCCTACCG GGTGGACCAG GGGGTCCTCA ACA/\CGGCTG 12 0 

CTTTGTGGAC GCCCTCAACG TGGTGCCGCA CGTCTTCCTA CTCTTCATCA CCTTCCCCAT 180 

CCTCTTCATT GGATGGGGAA CTCAGAGCTC CAAGGTGCAC ATCCACCACA GCACATGGCT 24 0 

TCATTTCCCT GGGCACAACC TGCGGTGGAT CCTGACCTTC ATGCTGCTCT TCGTCCTGGT 300 

GTGTGAGATT GCAGAGGGCA TCCTGTCTGA TGGGGTGACC GAATCCCACC ATCTGCACCT 360 

GTACATGCCA GCCGGGATGG CGTTCATGGC TGCTGTCACC TCCGTGGTCT ACTATCACAA 42C 

CATCGAGACT TCCP-ACTTCC CCAJ^GCTGCT AA.TTGCCCTG CTGGTGTATT GGACCCTGGC 4 80 

CTTCATCACC .A.AGACCATCA AGTTTGTCA.A GTTCTTGGAC CACGCCATCG GCTTCTCGCA 54 0 

QCTACGCTTC TGCCTCACAG GGCTGCTGGT GATCCTCTAT GGGATGCTGC TCCTCGTGGA 600 

GGTCAJITGTC ATCAGGGTGA GGAGATACAT CTTCTTC.A.AG ACACCGAGGG AGGTGAAGCC 660 

TCCCGAGGAC CTGCAAGACC TGGGGGTACG CTTCCTGCAG CCCTTCGTGA ATCTGCTGTC 72 0 

CAAAGGCACC TACTGGTGGA TG.AACGCCTT CATCAAGACT GCCCACAA^GA AGCCCRTCGA 78 0 

CTTGCGAGCC ATCGGGAAGC TGCCCATCGC CP.TGAGGGCC CTCACCAACT ACCAACGGCT 840 

CTGCGAGGCC TTTGACGCCC AGCGGAAGGA CATTCAGGGC ACTCAAGGTG CCCGGGCCAT 900 

CTGGCAGGCA CTCAGCCATG CCTTCGGGAG GCGCCTGaTC CTCAGCAGCA CTTTCCGCAT 960 

CTTGGCCGAC CTGCTGGGCT TCGCCGGGCC ACTGTGCATC TTTGGGATCG TGGACCACCT 1020 

TGGGAAGGAG AACGACGTCT TCCAGCCCA-A. GACACAJi.TTT CTCGGGGTTT ACTTTGTCTC 1080 

ATCCCAAGAG TTCCTTGCCA ATGCCTACGT CTTAGCTGTG CTTCTGTTCC TTCCCCTCCT 114 0 

ACTGCAJIlA.GG ACATTTCTGC A-AGCATCCTA CTATGTGG CC ATTG.AAACTG GAATTAACTT 1200 

GAGAGGAGCA ATACAGACCA AGATTTACA.A TAAAATTATG CACCTGTCCA CCTCCAACCT 12 60 

GTCCATGGGA GA_-ATGACTG CTGGACAGAT CTGYAATCTG GTTGCCATCC ACACCZ-ATCA 1320 

GCTCATGTGG TTTTTCTTCT TGTGCCCAJU^, CCTCTGGGCT ATGCCAGTAC AGATCATTGT 1380 

GGGTGTGATT CTCCTCTACT ACATACTCGG AGTCAGTGCC TT.^-^TTGGAG CAGCTGTCAT 14 40 

CATTCTACTG GCTCCTGTCC AGTACTTCGT GGCCACCAAG CTGTCTCAGG CCCAGCGGAG :500 

CACACTGGAG TATTCCAA.TG AGCGGCTGAA GCAGACCA^.C GAGPvTGCTCC GCGGCATC.AA 1560 

GCTGCTGAA.G CTGTACGCCT GGGAGAACAT CTTCCGCACG CGGGTGGAGA CGACCCGCAG 1620 

GAJ^GGAGATG ACCAGCCTCA GGGCCTTTGC CATCTATACC TCCATCTCCA TTTTCATGA,A 16S0 

CACGGCCATC CCCATTGCAG CTGTCCTCAT AJ^rTTTCGTG GGCCATGTCA GCTTCTTCAA 17 40 
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AGAGGCCG.^C TTCTCGCCCT CCGTGGCCTT TGCCTCCCTG 7CCCTCTTCC ATATCTTGG7 1800 

CACACCGCTG TTCCTGCTGT CCAGTGTGGT CCGATCTACC GTGAAAGCTG TAGTGAGGGT 13 60 

GCAAAAGCTA AGCGAGTTCC TGTCCAGTGC AGAGATCCGT GAGGAGGAGT GTGCGGGCCA 1920 

TGAGCCCAGA CCTCAGGGCC CAGCCAGCAA GTACCAGGCG GTGCCCCTCA GGGTTGTGAA. 198 0 

CCGCAAGCGT CCAGCCCGGG AGGATTGTCG GGGCCTCACC GGCCCACTGC AGAGGGTGGT 20 4 0 

CCCCAGTGGA GATGGCGATG CTGACAACTG CTGTGTCCAG ATCATGGGAG GCTAC7TCAC 2 100 

GTGGACCGCA GATGGAATCC CCACACTGTC CAACATCACC ATTGGTATCC CCCGAGGCCA 2160 

GCTGACTATG ATCGTGGGGC AGGTGGGCTG CGGCAAGTCC TCGCTCCTTC TAGCCGCACT 2 2 20 

GGGGGAGA7G CAGAAGGTCT CAGGGGCTGT CTTCTGGAGC AGCAGCCTTC CTGACAGCGA 22 8 0 

GATAGGAGAG GACCCCAGCC CAGAGCGGGA GACAGCGACC GACTTGGATA TCAGG-JVGAG 2 34 0 

AGGCCCCGTG GCCTATGCTT CGCAGAAACC ATGGCTGCTA AATGCCACTG TGGAGGAGAA 2 4 GO 

CATCATCT7T GAGAGTCCCT TCAACAAACA ACGGTACAAG ATGGTCATTG AAGCCTGCTC 2 4 60 

TCTGCAGCCA GACATCGACA TCCTGCCCCA TGGAGACCAG ACCCAGATTG GGGA^AGGGGG 2 520 

GATCAACCrG TCTGGTGGTC AACGCCAGCG AATCAGTGTG GGCCGAGCCC TCTACCAGCA 2 58 0 

CGCCAACGTT GTG7TCTTGG ATGACCCCTT C7CAGCTCTG GATATCCATC TGAG7GACCA 2 64 0 

CTTAATGGAG GCGGGCATCC 7TGAGCTGCT CCGGGACGAC AAGAGGACAG TGGTC7TAGT 2 7 00 

GACCCACAA.G CTACAGTACC 7GCCCCATGC AGACTGGATC A7TGCCATGA AGGA7GGCAC 2 7 60 

CATCCAGAGG GAGGGTACCT CAAGGAiCTTC CAGAGGTCTG AATGCCAGCT CTTTGAGCAG 2 8 20 

TGGAAGAGCT CATGA.ACCGA CAGGACCAAG AGCTGGAGAA GGAAATGTCA CAGAGAGAAA 2 8 80 

AGCCACAGAG CCACCCAGGG CCTATCTCGT GCCATGTCCT CGAAGGGATG GCCT77TGCA 2 94 0 

GGATGAGGAA GAGGAGGAAG AGGAGGCAGC TGAGAACGAG GAGGATGACT ACCTG7CGTC 30 00 

CATGCTGGAG CAGCGTGC7G AGATCCCATG GCGAGCCTGC ::CCAAGTACC TGTCG7CGGC 3060 

CGGCATCG7G CTGC7GTCG7 TGC7GGTC77 C7CACAGC7C C7CA.AGCACA TGGTC7TGG7 3120 

GGCCATCGAC TAC7GGCTGG CCAAGTGGAG CGACAGCGCC C7GACGCTGA GCCC7GCAAC 3180 

CAGGAAC7GC TCGC7CAACC AGGAGTGCAC CCTCAACCAG ACTGTCTATG CCTTGGTGTT 32 4 0 

CACGGTGC7C TGCAGCCTGG GCATTGTGC7 G7GCCTCGTC ACGTCTGTCA CTGTGGAGTG 3300 

GACAGGGC7G AAGG7GGCCA AGAGACTGCA CGGCAGCCTG CTAAACCGGA TCA7CGTAGG 3 3 60 

CCCCATGAGG TT777TGAGA CCACGCCCCT TGGGAGCATC CTGAACAGA7 TTTGA7CTGA 3 4 20 

CTGTAACAGC ATCGACCAGC ACATCCCATC CACGCTGGAG TGCCTGAGCC GCTCCACCCT 34 80 

GCTCTGTG7C TCAGCCCTGG CCGTCATCTC CTATGTCACA CCTGTGTTCC TCGTGGCCCT 35 4 0 

CTTGCCCCTG GCCA7CGTGT GCTACTTCAT CGAGAAGTAC TTCCGGGTGG CGTCCAGGGA 3 600 

CCTGCAGCAG CTGGATGACA CCACCCAGC7 TCCACTTCTC TCACACTTTG CCGAAJ^CCGT 3 660 

AGAAGGAC7C ACCACCATCC GGGCCTTCAG GTATGAGGCC CGGTTCCAGC AGAAGCTTCT 37 20 

CGAATACACA GACTCCAACA ACATTGCTTC CCTCTTCCTC ACAGCTGCCA ACAGA7GGCT 37 8 0 

GGAAGTCCGA ATGGAGTACA TCGGTGCATG TGTGGTGCTC A7CGCAGCGG TGACC7CCAT 38 4 0 

CTCCAAC77C CTGCACAGAG AGCTCTCTGC TGGCCTGGTG GGCCTGGGCC TTACCTACGC 3 900 

CCTAATGG7C TCCAACTACC TCAACTGGA7 GGTGAGGAAC C7GGCAGACA TGGAGCTCCA 3 960 

GCTGGGGG77 GTGA.AGCGCA TCCATGGGCT CCTGAAAP'.CC GAGGCAGAGA GCTAGGAGGG 4 0 20 

GCTCCTGGCA CCA7CGCTGA TCCCAAAGAA C7GGCCAGAC G-AAGGGA.AGA 7CCAGATCCA 4 0 80 

GAACC7GAGC GTGC3CTACG .ACAGCTCCC"' GAAGCCGG7G C7GAAGCAGG 7CAA73CCC7 4 140 

CA7CTCCC77 GGACAGAAGA * C'oGGATC7^^) CGGCCGCACC GGCAoTGGGn. Avj7'^C . CCT'. 4zOC 

C7C7CT7GCC T7C77CCGCA 7GGTGGAGAC G7TCGAAGGG CAGATGATCA TTGA7GGCA7 4 2 60 

7GACA7CGCC AAAC7GCCGC TGCACACGC7 GCGCTCt\CGC CTCTCCATCA TCCTGCAGGA 4 3 20 

CCCCGTCC7C TTCAGCGGCA CCATCCGAT7 TA.ACCTGGAC CC7GAGAGGA AGTGC7CAGA 4 3 80 

TAGCACAC7G TGGGAGGCCC TGGAAATCGC CCAGC7GAAG CTGGTGGTGA AGGCAGTGCC 4 4 40 

AGGAGGCC7C GA7GGCATCA TCACAGAAGG CGGGGAGAAT 77CAGCCAGG GACAGAGGCA 4 500 

GCTGTTC7GC CTGGCCCGGG CC7TCGTGAG GA.AGACCAGC ATCTTCATCA TGGACGAGGC 4 5 60 

CACGGCT77C ATTGACATGG CCACGGAAAA CATCC7CCAA AAGGTGGTGA TGACAGCCTT 4 620 

CGCAGACC3C ACTG7GGTCA CCATCGCGCA 7CGAGTGCAC ACCATCCTGA GTGCAGACCT 4 680 

GGTGATCG7G C7G-AAGCGGG GTGCCATCC7 7GAGTTCGAT AAGCCAGAGA AGCTGCTCAG 4 74 0 

CCGGAAGGAC AGCG7CTTCG CC7CCTTCG7 CCGTGCAGAC A^GTGACC7G CCAGAGCCCA 4 80C 

AGTGCCA7GC CACA7TCGGA CCCTGCCCAT ACCCCTGCCT GGG7TTTCTA ACTGT.AAATC 4 8 60 

AC77GTA.--A7 AAJ^.7AGAT77 GATTATTAA.A .AA.-AA.A,AAJV.A A.^„AAAAA,VvA AAAA^^^AAAAA. 4 92 0 

A.AAAAAA-A-AA A 4 931 

(2) :::rORMATION for SEQ id N0:3: 

ii; SEQ7i::CE CHARACTc:RIS7ICS ; 
;A) LE:-:GTH: ISSl amino ..^cids 
;3) TV?£: amino acid 
■C) 37PANDEDHESS : single 

4 6 
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{D) TOPOLOGY: linear 

XOLECULE TY?h:: peptide 

(vii' IMMEDIATE SOURCE: 
(A) LIBRARY : GenBank 
(B^ CLONE: 1369844 



(xil SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



Met 


?rc 


Leu 


Ax a 


Phe 


Cys 


Giy 


Ser 


Glu 


Asn 


His 


Ser 


Ala 


Aia 


Tyr 


Arg 










5 










10 










15 




Va i 


Asp 


Gin 


Gi V 


Vai 


Leu 


Asn 


Asn 


Gly 


Cys 


Pne 


Val 


Asp 


Vai 


Leu 


Asn 








20 










25 










30 






Val 


Val 


Pro 


ii i s 


Vai 


Phe 


Leu 


Leu 


Phe 


lie 


T h r 


Phe 


Pro 


lie 


Leu 


Phe 






35 










40 










45 








lie 


G 1 v 




G ^ 'v' 


Ser 


Gin 


Ser 


Ser 


Lys 


Val 


n 1 3 


I ie 


His 


His 


Ser 


T h r 




50 










55 










60 












Leu 


His 


Pne 


Pro 


Giy 


His 


Asn 


Leu 


Arg 


Trp 


lie 


Leu 


I n r 


Phe 


Met 


65' 










70 










7 5' 










30 


Lea 


i.eu 


Pne 




Leu 


Vai 


C y s 


Glu 


lie 


Ala 


Glu 


Giy 


lie 


Leu 


Ser 


Asp 










85 










90 










9 5 




G.y 


'/al 


Thr 


^ J. 


Ser 


liis 


His 


Leu 


His 


Leu 


7 y r 


Met 


Pro 


Aia 


G L. y 


Met 








ICO 










105 










110 






Ala 


P ne 


Met 


A 1 a 


Ala 


Vai 


Thr 


Ser 


Vai 


Vai 


T '/ r 


Tyr 


His 


Asn 


I ie 


Glu 






115 










120 










125 








Thr 


Ser 


Asn 


Phe 


Pro 


Lys 


Leu 


Leu 


I ie 


Aia 


Leu 


Leu 


Vai 


Tyr 


Trp 


Thr 




130 










135 










140 










Leu 


Al 6 


Phe 


i ie 


Thr 


Lvs 


Thr 


lie 


Lys 


Phe 


Vai 


Lys 


Phe 


Leu 


Asp 


His 


145 










150 










155 










160 


Ala 


lie 


A.la 


Phe 


Ser 


Gin 


Val 


Arg 


Phe 


Cys 


Leu 


Thr 


Gl y 


Leu 


Leu 


Vai 










165 










170 










175 




lie 


Leu 


Tyr 


Giy 


Met 


Leu 


Leu 


Leu 


Val 


Glu 


Val 


Asn 


Val 


lie 


Arg 


Val 








180 










185 










190 






Arg 


Arg 


Tyr 


lie 


Phe 


Phe 


Lys 


Thr- 


Pro 


Arg 


Liu 


Val 


Lys 


Pro 


Pro 


Giu 






195 










200 










205 








Asp 


Leu 


Gin 


Asp 


Leu 


Gly 


Val 


Arg 


Phe 


Leu 


■0 X Ti 


Pro 


Pne 


Va i 


Asn 


Leu 




210 










215 










220 










Leu 


Ser 


Lys 


G : ■/ 


Thr 


Tyr 


Trp 


Trp 


Met 


Asn 


A i a 


Pne 


lie 


L y s 


Thr 


Aia 


225 










2 30 










2 3 5 










240 


His 




Lys 




lie 


Asp 


Le u 


Arg 


Ala 


lie 




L ys 


Le u 






Aia 










245 










250 








2 55 




Met 


Arg 


Ala 


Leu 


Thr 


Asn 


Tyr 


Gin 


Arg 


Leu 


Cys 


Glu 


Aia 


Phe 


Asp 


Aia 








260 










265 










270 






Gin 


Vai 


Arg 


L y s 


Asp 


lie 


Gin 


Gly 


Thr 


Gin 


Giy 


Ala 


Ar a 


Aia 


He 


Trp 






275 










280 










285 








Gin 


A.ia 


Leu 




His 


Ala 


Phe 


Gly 


Arg 


Arg 


Leu 


Val 


Leu 


Ser 


Ser 


Thr 




290 










295 










300 










Phe 


Arg 


lie 


Leu 


Ala 


ASD 


Leu 


Leu 


G 1 y 


Phe 


Aia 


Gly 


Pro 


Leu 


Cys 


I ie 


305 










310 










315 










320 


Phe 


Gly 


lie 


Vai 


ASD 


His 


Leu 


Gly 


Lys 


Glu 


Asn 


Asp 


1 7 ~ ■> 


Phe 


Gin 


Pro 










325 










330 










335 




Lys 


T.^r 


Gin 




Leu 


vjj * y 


Val 


Tyr 


Phe 


Vai 


i;e r 


Ser 


Gin 


■3 i U 


Ph.e 


Leu 








340 










345 










350 






Ala 


Asn 


Ai a 




Vai 


Leu 


Ala 


Val 


Leu 


Leu 


Phe 


Leu 


Ala 


Leu 


Leu 


Leu 






355 










360 










365 








Gin 


Ara 


Thr 




Leu 


Gin 


Aia 


Ser 


Tyr 


T . - 

* / * 


Val 


Ala 


i i 


Giu 


Tnr 


Gly 




370 










375 










380 










lie 


Asn 


Leu 


•*--- g 


Giy 


Aia 


lie 


Gin 




" / 




i'yr 


Asn 




lie 


Met 


385 










3 90 










395 










400 
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His Leu Ser Thr Ser Asn Leu Ser Met Gly Glu Met Thr Giy Gin 

405 ^'.10 415 

lie Cys Asn Leu Val A. 3 lie Asp Thr v.sn Gin Lt^u Me: 'Ire Pne Phe 

420 425 430 

Phe Leu Cvs Pro Asn Leu Trp Aia Met Pro Vai Gin lie Il^i Val Giy 

435 440 445 

Val lie Leu Leu Tyr T'/r lie Leu Giy Vai Ser Aia Leu lie Giy Ala 

4 50 455 460 

Aia Val lie lie Leu Leu Aia Pro Vai Gin Tyr Phe Vai Ala Thr Lys 
465 470 475 480 

Leu Ser Gin Ala Gin Ara Thr Thr Leu Glu Tyr Sor Asn Arg Leu 

485 490 " 495 

Lys Gin Thr Asn Giu Ket Leu Arq Giy lie Lys Leu Leu l,ys Leu Tyr 

500 505 51C 

Aia Tro Giu Asn lie Pne Arq Thr Arq Vai Gia Thr Thr Ara Arg Lys 

515 520 ' 525 

Giu Met Tnr Ser Leu Ara Aia P'ne Aia lie Tyr Thr Ser lie Ser lie 

530 535 • 540 

Pne Met A^n Thr Ala lie Fro Ij.e Aia Ala Vai Lou lie Tor Phe Val 
543 550 555 560 

G^y His '/ai £er Pne Pn^ Lys G^u A^a A£p Pne Ser I'ro ior Val Aia 

56S ^ 570 575 

Pne Aia Ser Leu oor Leu i'he j-us lie Leu V^il Thr Pro Lou Phe Leu 

580 585 590 

Leu Ser Ser Val Vai Arg Ser Thr Val Lys Aia Lou Vai Ser Vai Gin 

595 600 605 

Lvs Leu Ser Glu Phe Leu Scr Sor Aia Giu ii.o Arc Giu Jj.j Gin Cvs 

610 615 620 

Aia Pro His Glu Pro Thr Pro Gin Giy Pro Ala Ser Lys Tyr Gin Aia 
625 630 635 640 

Val Pro Leu A.ro Vai Veil Asn Arg Lys Arg Pro Aia Aro .Jij Asp Cys 

645 " 650 ' 655 

Arq Glv Leu Thr Glv Pro Leu Gin Ser Leu Vai Iro Ser A",a Asp Giy 

660 ^ 665 b^C: 

ASD Aia ASD Asr. Cys C'/s Val Gin lie Met Giy Giy Tyt ro" Z':\r Trp 

675 680 685 

Thr Pro Asd Gi-/ lie Pro Thr Leu Ser Asn lie Tnr lie /-rT lie Pro 

690 " ' 695 700 

Arq Glv Gin ^eu Tnr Met lir- Val Giy Gin Val G'.'s "1/ Ia's Ser 

7C5 710 715 720 

Ser Leu Leu Leu Aia A. la Leu Gl\- Glu Met Glri '^-vS Val ^or Giy Aia 

725 730 ' 735 

Val Phe Trp Ser Ser Leu Pro Aso Ser Giu lie G^v GiU Aso Pro Ser 

740 . ' 745 ' 750 

Pro Glu Arg Giu Thr A. a Thr Asd Leu Asp lie Arq Lys .^.rc: Giy Pro 

755 760 765 

Vai Ala Tyr Ala Ser Gin Lvs Pro Trio Leu Leu Asn Aia Thr Vai Giu 

770 775 780 

Glu Asn lie lie Phe Glu Ser Pro Phe Asn Lys Gin Arg Tyr Lys Met 
785 790 ^95 800 

Val lie Giu Aia Cys Ser Leu Gin Pro Asp lie Asp lie l.eu Pro His 

805 810 ' 815 

Giy Asp Gin Thr Gin Il.e Giy Gia Ara Giy lie Asn Leu Se:- Giy Giy 

320 ^ 825 930 

Gin Arg Gin Ar:; lie Sor Vai Al^a Arg Aia Leu Tyr Gin ^-.^s Aia Asn 

835 ' 840 845 

Vai Val Pne Leu A.sc A.so Pro Phe Ser A.ia Leu Aso lie :;:s Leu Ser 

850 ' ' 855 860 

A.5p H:.s L.eu Met Gin Aia GJ y lie Leu Giu Leu L*^-u Aro Asp Asp Lys 
865 870 875 ' 880 
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Arg 


Thr 


i 


Vai 


Leu 
835 


Vai 


Thr 


His 


Lys 


Leu 
890 


Gin 


Tyr 


Leu 


P r o 


His 
895 


Aia 


Asp 


Trp 


He 


lie 
900 


Ala 


Met 


Lys 


Asp 


Gi V 
905 


Thr 


lie 


G i n 


Arg 


Glu 
910 


Giy 


Tnr 


L^u 


Lys 


."SO 

515 


Pne 


Gin 


Arg 


Ser 


G : u 
920 


Cys 


G 1 n 


Leu 


Phe 


Giu 
925 


f! 1 s 


Trp 


Lys 


Tl-.r 


Leu 
930 




A^r. 


Arg 


Gin 


A.SD 

935 


Gin 


Giu 


Leu 


Giu 


Lys 
940 


Giu 


Thr 


Vai 


Thr 


o — . 


Arg 


Lys 


Aia 


Thr 


g:.u 


Pro 


Pro 


Gin 


Giy 


Leu 


Ser 


Arg 


Aia 


Met 


Ser 


945 










950 










955 










960 


Ser 


Ara 


Asp 


Gly 


Leu 
965 


Leu 


G 1 


Asp 


Giu 


Giu 
970 


Giu 


Giu 


Giu 


Glu 


Giu 
975 


Aia 


k.a 


Giu 


5er 


G i u 
98C 


Glu 


Asp 


Asp 


Asn 


Leu 
985 


Ser 


Ser 


Mcz 


Leu 


His 
990 


Gin 


Arc 


A. a 


Glu 


: 1 0 


?rz 


Trp 


Ar g 


Ala 


Cvs 


Aia 


Lys 


Tyr 


Leu 


Ser 


Ser 


Aia 


Giy 






595 










lOCC 










1005 






lie 


Leu 
lOlC 


Lou 


Leu 


Ser 


Leu 


Leu 
1015 


Vczi 


Phe 


Ser 


Gin 


Leu 
1020 


Leu 


Lys 


His 


Met 


Va^ 


Leu 


.'ii 


Ax^ 


lie 


A3T0 


y r 


Trp 


Leu 


A.ia 


LVs 


Trp 


Thr 


Asp 


Ser 


Aia 


025 










1030 










1035 








1040 






_ r u 




Pro 


Ala 


A i a 


Ara 


Asn 


Cys 


Ser 


Leu 


Ser 


Gin 


Giu 


Cys 










:o*i5 










.050 








1055 






Lg J 


.-■sp^ 


^ r. 


Thr 


Vai 


i •,' r 


A^a 


Met: 


Vui 


Pne 


Thr 


Ax a 


Vai 


Cys 


Ser 








oe: 










1065 








1070 






Le J 


Giy^ 


; _e 


Va 1 


Leu 


Cys 


Leu 


Veil 


Thr 


Ser 


Vai 


Thr 


Vai 


Giu 


Trp 


Thr 






075 










lOSO 








1085 






Gly 


Leu 


Lys 


Vai 


Ala 


Lys 


Arg 


Leu 


His 


Arg 


Ser 


Leu 


Leu 


Asn 


Arg 


lie 


1090 










1095 






1100 










He 


Leu 


Ala 


Pro 


Met 


Arg 


Phe 


Phe 


Gi u 


Thr 


Thr 


Pro 


Leu 


Giy 


Ser 


lie 


105 








1110 








1115 








1120 


Le J 


Asn 


;vrq 


Phe 


Ser 


Ser 


Asp 


Cys 


Asn 


Thr 


lie 


Asp 


Gin 


His 


lie 


Pro 










_125 








1130 






1135 








Leu 


Gi ^ 


Cys 


Leu 




Ar g 


Ser 


Thr 


Leu 


Leu 


Cys 


Vai 


Ser 


Aia 






114: 










1145 








L150 






Le'J 


Ai3 


Vai 


Tie 


Ser 


Tyr 


Vai 


Thr 


Pro 


Vai 


Phe 


Leu 


Vai 


Aia 


Leu 


Leu 






155 








1160 








1165 








Fro 




.-Lia 


lie 


Vai 


Cys 


Tyr 


Pne 


He 


Gin 


Lys 


Tyr 


Pne 


Arg 


Vai 


Aia 




1170 








1175 








1180 










Ser 


Ara 




L6 u 


Gin 


Gir. 


Leu 


As r. 


Asc 


Thr 


Thr 


Gin 


Leu 


Pro 


Leu 


Leu 


1 c 3 








1190 








1195 










20 0 


Ser 


His 


rne 


.-^^ _ =: 


G i u 


Tnr 


Vai 


G i u 


' J _ " ,' 


i^e u 


Tnr 


Thr 


lie 


Arg 


Ala 


Pne 








1205 










210 








1215 




Arg 


Tyr 


G ^ u 


Aii 


Arg 


Phe 


Gin 


Gin 


Lys 


Leu 


Leu 


Giu 


Tyr 


Thr 


Asp 


Ser 






1 


22''; 








1225 








1230 






Asn 


Asn 


lie 


Ala 


Ser 


Leu 


Pne 


Leu 


Thr 


Aia 


Aia 


Asn 


Arg 


Trp 


Leu 


Gi u 






2 35 








1240 








1245 








Vai 


Arg 


^^et 


Gi 


Tyr 


lie 


Giy 


Aia 


Cys 


Vai 


Vdl 


Leu 


lie 


Aia 


Aia 


Vai 



1250 1255 1260 



Tnr Ser lie Ser Asn Ser Leu His Ara Giu Leu Ser Aia Giy Leu Vai 

265 1270 1275 1280 

Giy Leu 2iy LeU Thr Tyr Aia Leu Met '.'ai Ser Asn Tvr Leu Asn Trp 

1285 1290 ^ 1295 

Met Vai Arq Asn Leu Aia Asp Met Giu Leu Gin Leu Giv Aia Vai Lys 

'l30: ^ 1305 ^1310 

.Arg lie his Gi', Leu Leu Lys Thr Giu Aia Giu Ser Tvr G*u Giy Leu 

1315 1320 1325 

Leu Aia Pro Ser Leu lie Pro Lvs Asn Trp Pre Asd Gin Giv Lvs lie 

1330 1335 " ' 1340 

Gin lie iin Asn Leu Sor '/ai Ara T\-r Asd Ser Ser Leu Lvs Pro Vai 

34 5 1350 ' ' 1355 ' 1360 
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Leu Lys His Vai Asn Aia Leu lie Ser Pro G 1 y G:n l.vs lie Giy lie 

1365 1370 ' i37S 

Cys Giy Arg Thr Giy Ser G^y Lys Sor Sor I^ne Ser Leu Ali Phe Phe 

1380 ^ 1385 1390 

Arg Met Vil Asd Thr ?n.e Giu Giy His lie lie lie Asc Glv lie Asp 

1395 ' 1400 1405 

lie Aia Lyr^ Leu Pro Leu His Thr Leu Ara Ser Arg Leu Ser lie lie 

1410 1415 ' 1420 

Leu Gin Asd Pro Vai Leu Phe Ser Giy Thr :ie Arq Pne Asn Leu Asp 

425 ' 1430 1435 1440 

Pro Giu Arq Lys Cys Ser Asd Ser Thr Leu Trn Giu Aia Leu Giu lie 

1445 ' 1450 " 1455 

Aia Gin Leu Lys Leu Vai \'ai Lys Aid Leu Pro Giy Giy Leu Asp Aia 

1460 ^ 1465 ' 1470 

lie lie Thr Giu Giy Giy Giu Asn Phe Ser Gir. Giy Gin Arc Gin Leu 

1475 1480 1485 

Phe Cvs Leu Aia Arq Aia Phe Vai Arg Lys Thr Ser Tie Pne li. e Met 

1490 ' 1495 " 1500 

Asd Giu Aia Thr Aia Ser lie Asd Met Aia Tnr Giu Asn lie Leu Gin 

505 1510 ' 1515 1520 

Lys Vai Vai Met Thr A*a Pne Aia Asp Arc: Thr V'c; 1. v/. : Inr lie Aia 

1525 1530 1535 

His Arq Vai His Thr lie Leu Ser Aia Asp l.ou Vai T 1 Vu^ :-ou Lvs 

1540 1545 1550 

Arq Glv Aia lie Leu Giu Phe Asp Lvs Pro Giu x-vs Leu ^eu Ser Arg 

^ 1555 1560 ^ ' 1565 

Lvs Asd Ser Vai Phe Aia Ser Phe Vai A r q A 1 a Asp Lvs 

1570 1575 1580 

(2; i:;FORMATION FOR SEQ ID NO: 4: 

(!) S^OUEKGE CHARACTERISTICS: 
(Ai LENGTH: 1581 amino acids 
{EJ TYPE: amino acid 
;C; STRANDEDNESS : single 
iD) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 

(vii. IMMEDIATE SOURCE: 
1 k LIB PARY : G e noan k 
[B] CLONE: 1311534 

(xi: 3E0"ENCE OESCR I PT lOtJ : SEQ ID NO : 4 : 

Met Pro Leu Aia Phe Cys Giy Thr Giu Asn fiis Ser Aia Aia Tyr Arq 

5 10 15 ' 

^' a 1 Asd Gin G i '/ Vai Leu Asn Asn Giy c; v s Pne / a 1 A s c Aia Leu Asn 

20 ' 25 ' ' 30 

Vai Vai Pro His Vai Phe Leu Leu Phe He Tnr the Pr" lie Leu Phe 

3 5 4 0 4 5 

lie Glv : r o G 1 •,■ Ser Gin Ser Ser Lys Vai his I ^ e His p; i s Ser Thr 

50' " ' 55 ^ 60 

Tro Leu His Pne Pro Glv His Asn Leu Arg 1 ro li.e Leu I'hr. Pne lie 

65" *7o' 75 SO 

Leu Leu Phe Vai Leu Vai Cys Giu lie Ala Giu Giy lie Lou Ser Asd 

85 90 95 

G 1 V \'al I n r Giu Ser r g His Le u His Leu ~ v r Me t Pre Ai a G 1 v Me t 

100 105 ' 110 



50 
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Ala Pne 1':^-: A^-:. Aia Ij.e T.^.r Ser Vai Vai Tvr Tvr iiis hsr. lie Glu 

115 120 125 

Tr.r Ser Asn ?r.e Pro L'/s Leu Leu lie Aia Leu Leu lie Tvr Tro Tr. r 

130 ' 135 140 

Leu Aia the lie Thr Lvs Thr lie Lvs Phe Val. Lvs Phe Tyr Asp His 

145 150 155 160 

A^a lie Gly Pne Ser Gin Leu Arg Phe Cvs Leu Thr Gly Leu Leu \'al 

165 170 175 

lie Leu Tvr Glv Met Leu Leu Leu Val Giu Val Asn Vai lie Arq Val 

160 185 19C 

Arc: A.rq Tvr '/al Phe Phe Lvs Thr Pro Arg G-u Val Lvs Pro Pro G*u 

195 200 205 

ASD Leu Gin Aso Leu Glv Vai Ara Phe Leu Gin Pro Phe Vai A.sn Leu 

210 " ' 215 ^ 220 

Leu Ser Lys Gly Thr Tyr Trp Trp Met Asn Ala Pne lie Lys Thr Aia 

225 230 * * 235 240 

His Lvs Lys Pro lie AsD Leu Ara Aia lie Giv Lvs Leu Pro lie Aia 

245 ' ' 250 ' ' 255 

Met .Arg Aia Leu Thr Asn Tyr Glr* Arg Leu Cys Leu Aia Ph^ A.sp Aia 

260 265 270 

Gir. Aia Ara L'.-s Asd '\:\r 'Zlr. Ser Gin Gin Glv Aia Ara Aii lie Tro 

275 " 280 ' 285 

Arg Aia :.eu C'.'s His Aia Phe Giv Ara Ara Leu Vai Leu Ser Ser Tr.r 

290 ' 295 " 300 

Phe Ara lie Lsu Aia Asd Leu Leu Gly Phe Aia Gly Pro Leu Cys He 

305 310 315 320 

Phe Gly lie Veil Asp His Leu Gly Lys Giu Asn His Vai Phe Gin Pro 

325 330 335 

Lvs Thr Gin Phe Leu Gly Val Tyr Phe Val Ser Ser Gin Giu Phe Leu 

340 345 350 

Giv Asn Aia T'/r Val Leu Ala Vai Leu Leu Pne Leu Aia Leu Leu Leu 

355 360 365 

Gin A.rg Thr Pre Leu Gin A.ia Ser Tvr Tyr Vai Ala lie Giu Thr Gly 

370 375 " ' 380 

lie Asn Leu Ara Gly AI.a lie Gin Thr Lvs lie T-/r Asn Lys lie Met 

385 ' 390 ^ 395 ' ' 400 

His Leu Ser Tr.r Ser Asn Leu Ser Met Giv Giu Met Thr Aia Gly Gin 

405 410 415 

lie Cys Asn „ej Vai Aia lie Asc Thr Asn Gin Leu Met Trp Phe Pne 

4 20 ' 4 25 4 3C 

Pne Leu Cys Pro Asn Leu Trp Aia Met Pro Val Gin Tie I _ e Val Gly 

435 440 445 

Vai lie Leu Leu Tyr Tyr lie Leu Gly Vai Ser Aia Leu lie Gly Ala 

450 455 ' 460 

Ala Vai Tie T„e Leu Leu A.ia Pro Vai Gin Tvr Phe Vai Ala Thr Lys 

465 470 475 430 

Leu Ser Gin A_a Gin Ara Thr Thr Leu Giu T\*r Ser Asn G^^u Arg Leu 

485 490 ^ 495 

Lvs Gin Thr Asn Giu Met Leu Ara Gly lie Lvs Leu Leu Lvs Leu Tyr 

500 505 ' SIC 

Aia Tro Giu A.sn lie Phe Cvs Ser Arq Vai Giu Lvs Thr Ara Ara Lys 

515 ^ 520 ' ^ 525 ' 

G_j Met Tnr Ser Leu Arq Ala Phe Ala Vai Tvr Tnr Ser l.e Ser lie 

530 ' 535 ' 540 

Phe Met Asn Thr Aia He Pro lie Aia Ala Val Leu lie Thr Pne Val 

545 550 555 560 

Giv His '.'al Ser Phe Phe Lvs Giu Ser Asp Phe Ser Pro Ser Vai Aia 

565 ^ 570 575 

Pne Aia Ser Leu Ser Leu Phe His lie Leu Val Thr Pro Leu Phe Leu 

580 5S5 590 
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Leu Ser Ser Wa'^ Vai Ara Ser Thr Vai Lvs Aia Leu Vai Ser Vdi Gin 

595 600 605 

Lys Leu Ser Ciu Phe Leu l^er Ser Aia G-u lie Arq Giu 31j Gin C'/s 

610 615 620 

Ala Pro Arg Giu Pro A* a Pre Gin Giy Gin Aia Glv Lys T\'r Gin r-.'ia 

625 630 635 ' ^ 640 

Vai Pro Leu Lys Vai Vai Asn Arq Lvs Arq Pro Ala Ara Glu Giu Vai 

645 " 650 ' 655 

Arg Asp Leu Leu Giy Pre Leu Gin Arq Leu Thr Pro Ser Thr Asp Giy 

660 665 670 

Asp Aia Asp AsTi Phe Cys Vai Glr. lie lie Giy Giy Pne Phe Thr Trp 

675 680 685 

T r. r P r c .As p Glv lie Pre 1" r Leu Ser As n lie Thr lie Ara lie ? r o 

690 ' 695 700 

Ara Giy Gin Leu Thr Met lie Vai Giy Gin Vai Giy Cvs G1-. Lvs Ser 

705 710 ' 715 ' ' ' 720 

Ser Leu Leu Leu Aia Thr ..,eu Glv Glu Mer GJ n Lys Vai Ser Giy Aia 

725 ' 730 ' 735 

Vai Phe T r p A s r. Ser Leu r' r o Asp Ser Giu Giy Glu A s c Pro Ser A s n 

740 ' 745 ' 750 

Pro Giu Arg Glu Thr Aia A.la .-^.sc Ser Aso Ala Arq Ser Ar::i Giy Pro 

755 "^bO 765 

Vai A.ia lye Ala Ser Gin Lys Pre Trr^ Leu Leu Asn Ala lr,r Vdi l-iu 

770 775 * 78C 

Glu Asn lie Thr Phe Glu Ser Pro Phe Asn Lvs Gin Arq T'.'r Lvs Me:: 

785 790 795 ^ ^ 300 

Vai lie Giu Aia Cys Scr Leu Gin Pro Asp lie Asp lie Leu Pro His 

805 810 815 

Giy Asp Gin Thr Gin lie Giy Glu Aro Giy lie Asn Leu Ser Giy Giy 

820 S25 830 

Gin Arg Pro Giy lie Ser Vai Ax a Arq Aia Leu Tyr Gin H:s Thr Asn 

835 840 845 

Vai Vai Pne Leu Asp Asp L'ro Phe Ser Aia Leu Asp Vai His Leu Ser 

850 855 860 

Asp His Leu T-le^ Gin Aia Giy 11c- Leu Ciiu Leu :.eu Arq /'.so Asd Lys 

865 870 875 ' 880 

Arq Thr Vai Vai. Leu Vai Tnr Hr;:> l.ys Leu Gin T'/r i^eu rro His Aia 

885 890 ' 895 

AsD Trp lie lie A.^a Met Lvs Aso Gl',' Thr olr; Arq G . Glv Tnr 

90C 905 " 910 

Leu Lvs A.s o P n o Gin Arc S e r G 1 C \* s Gin Le 1' n e G 1 u r. , j T r e w v s 

915 920 925 

Thr Leu Met Asn Arq Gin Asp Gin Giu Leu Glu vs Glu Tnr Va 1 Me t 

930 935 940 

Giu Arg Lvs Ala Pro Glu Pro Scr Gin Giy Leu Pro Arq A^^ Met Ser 

945 950 955 ' 960 

Ser Arg Asp Giy Leu Leu Leu Asp Glu Asp Giu Glu Glu Glu Glu Aia 

965 ' 970 975 

Ala Giu Ser Glu Giu Asp Asp Asn Leu Ser Ser Vai Leu His Gin Arg 

98 0 " 98 5 9 90 

Ala Lys lie Pro Trp Arq Ala Cys Thr Lys Tyr Leu Ser Scr Ala Giy 

995 ' 1000 ' 1005 

lie Leu Leu Leu Ser Leu Leu Vc^Il P:;e Ser Gin Leu Leu i.'.-:: His Met 

1010 1015 1020 

Vai Leu Vai A.la lie Asd 1 yr Trp Leu Ala l/vs Tro Tnr A~o Ser Aia 

025 1030 ' ' 1035 " " 1040 

Leu Vai Leu Ser Pre Aia Ala Aro Asn Gvo Ser Leu Ser Gin Giu Cvs 

1045 ' 1050 1055' 

A,..a Leu Asd Gin Ser Vai T'/r Ala Met Vai Phe "Inr "/al Leu Cys Ser 

1060 1065 107C 
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Leu Glv Tie Lej C\-e Leu Vai Thr Ser Viii Thr V^i G j Trp Thr 

' iC75 iOSC 1065 

Glv Leu Lys '/al Ala Lys hrg Leu Hi3 Arg Ser Leu Lt^u Asn Arg Zle 

' 1090 ' ^ 109b 1100 

lie Leu A-la "r:. Nlc^t Ara Pne Phe Gl'^ Thr Tr.r Pre Leu Gly Ser lie 
105 1110 1115 1120 

Leu Asn Arg Pr.^ Ser S-^r Asp G',*? A^r, Thr lie Asp Gin His He Pro 

1125 ' ^ 1130 1135 

Ser Thr Leu Glu Cys Leu Scr Arg Ser Thr Leu Leu Cys Val Ser Ala 

1140 1145 1150 

Leu Aid Val He Ser Gyr Val Tnr PrD Val ?ne Leu Val Ala Leu Leu 

1155 ]160 1165 

Pro Leu Ax a Val Val C:s Tyr Pne He Gin Lys Tvr Phe Arc: V<al Ala 

117 0 117:; 1180 

Ser Arq Asp Leu Gin Gin Leu Asd Asp Thr Thr G.n Leu Pro Leu Leu 
185 ' " 1190 " 1195 1200 

Ser H- r s Phe A^ a G 1 u T r. r Val G i u G 1 v Le u T h r Thr lie Arc: Ala Phe 

1205 '1210 "l215 

Ara Tvr Giu A- a Arg Pne C-r: Gin Lvs Leu Leu G1.U Tvr Tr.r Asp Ser 

1220 1225 ' 1230 

Asn Asn He A»-a Ger ^- Pr:e L.eu 'Ihr Ala .Ala A.sn /*ra Trr; Leu 51 u 

1235 1240 1245 

Val Arg Met: G^u Tvr ^-Tr Gly Aia Gvs Val I'al Leu He Ala Ala Ala 

1250 1255 " 1260 

Thr Ser He Ser Asn Ser Leu tiis Arq Glu Leu Ser Ala Glv Leu Val 
265 1270 1275 1280 

Glv Leu Glv Leu Thr Tvr Ala Leu Met Val Ser Asn Tvr Leu Asn Trp 

1285 1290 ^ 1295 

Met Val Ara Asn Leu Alca Asp Met Glu He Gin Leu Gly Ala Val Lys 

'l30C 1305 1310 

Gly He His Tr.r Leu Leu Lvs Thr Glu Ala Glu Ser Tvr Glu Gly Leu 

1315 ^ 1320 1325 

Leu Ala Pro Ser Leu He Pro Lys A.sn Trp Pro Asp Gin Gly Lys He 

1330 1335 1340 

Gin lie Gin Asn Leu Ser V.m:. Ara T-,t Asq Ser Ser Leu L'/s Pro Vd-i. 
34 5 1 350 ' ^ ' 1355 ' 1360 

Leu Lys His Va^ Asn A* a Leu He Ser Pro Gly Gin Lys He Gly He 

1365 1370 ^ ' 1375 

Cys Gly Ara Tnr Gly Scr Gly i.s Ser S-^r Pne Ser A^^i Pne Phe 

1380 ' ' ' 1365 1390 

Ar a Me t \'a 1 As c Me t F ne G 1 u G J y A.r g He He He Aso G - y Ho Asp 

1395 ' 1400 1405 

He Ala L','s Leu Pro Leu His Thr Leu A.rg Ser Ara Leu Ser He He 

1410 1415 1420 

Leu Gin Asd Pro Va 1 Leu P n e Ser Glv Thr 11 Am Pne A.s n Lou Asp 
425 ' 1430 ' 1435 ' 1440 

Pro Giu Lvs Lv3 Gys Ser Asd Scr Thr Leu Tro Glu Ala Leu Giu He 

14 4 5 14 50 ] 4 55 

A J. a Glr^i Leu Lys Leu Val Vdi Lys Ala Leu Pro Glv Glv Leu Aso A.la 

1460 1465 ' 'l470 

He He Thr G ^ u Glv Glv Glu As n Phe Ser Gin G i y G 1 n Ara Gin Leu 

1475 ' ' 14 80 "l485 

Phe Cys Leu Ala Arg A:.a Phe Val Ar::; Lvs Thr Ser He Pr.e He Met 

1490 1495 1500 

Aso Giu Ala Tnr Ala Ser He Asc Met Ala Thr G^u Asn He Leu Gin 
505 1510 1515 1520 

Lys Val Val Met Thr A.^ Phe Ala Aso Arci Thr Val V^i Thr Ire Ala 

1525 1530 1535 

His Ara V a i H _ s Thr He Leu Ser Ala Asp Leu Val Met Val Leu Lys 
1540 1545 ^ 1550 
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Ar:; Giy Ala Leu Giu Pne Asp Lvg Tro Giu Gys Leu Leu Ser 31 n 

1555 1560 ' 1565 

L'.'s AsD Ser '.-I ?he Ala Ser Phe Veil r-.r :: Ala Aso Lys 
1570 157 5 1580 

(2) i::rORMATION for SEQ id N0:5: 

(i) SEQ'CZllCE CHARACTERISTICS; 

(A) LE::gTH: 1582 amino acids 

(B) TV?!: amino acid 

(C) ST?ANDEDNESS: smqle 
;D) TOPOLOGY: linear 

(11) M0L£3ULE TYPE: peptide 

(vii) iMvrQXATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 784874 

;>:i; 3EQLENCE DESCRI PT I0^: : SZQ ID NO : 5 : 



Me 


Pro 


Leu 






Phe 

5 


C y s 


Glv 




G 1 u 


A.r; n 






A 1 a 




1' v r 
15 


A r q 


v' ^ ' 


Asp 


Gin 






Val 


Leu 


A.sn 


Asn 


G 1 y 






V.i ^ 


Asp 


Ai a 


Leu 


As n 




















2 5 










30 






Val 


Val 


Pro 


h 


c 


Val 


Phe 


Leu 


Leu 


Phe 


He 


Thr 


Phe 


Pro 


He 


Leu 


Pne 






T 5 












40 










45 








lie 


Glv 


T rp 


Gl 




Ser 


Gin 


Ser 


Ser 


Lys 


V a 1 


His 


lie 


His 


His 


Ser 


Tnr 




50 












55 










60 












Leu 


His 


?h 




Pro 


Gly 


His 


Asn 


Leu 


Arg 


Trp 


lie 


Leu 


Thr 


Phe 


I J- e 


65' 












70 










7 5 










80 


Leu 


Leu 


?ne 


Va 




Leu 


Val 


Cys 


Glu 


He 


Aia 


Glu 


GL y 


He 


Leu 


Ser 


Asp 












85 










90 










95 




Ql y 


Val 


Thr 


G^ 




Ser 


Arq 


Kis 


Leu 


His 


Leu 


Tyr 


Met 


Pro 


Ala 


Gly 


Met 








10 












105 










110 






Ala 


Phe 




A_ 


a 


Aia 


He 


Thr 


Ser 


V^l 






Tyr 


!lis 


Asn 


He 


Glu 






115 












120 










125 








Tr.r 


Ser 


Asn 






Pro 


Lys 


Leu 


Leu 


lie 


Ai a 


Leu 


Leu 


He 




Trp 






130 












135 










140 










Leu 


All a 


? p. 8 






Thr 


Lvs 


q< K r 


He 


L\'3 


L: e 


Val 


l.ys 


Phe 




ASD 


H i s 


145 












150 










155 










160 


.^^^ a 


lie 


_ y 






Ser 


Gin 


Leu 


Arg 


Ph^ 


v' s 


Leu 




Gly 


Leu 


Leu 














165 










170 










175 






Leu 


7yr 






Met 


Leu 


Leu 


Leu 


Val 


Glu 


V a 1 


Asn 


Vai 


He 


Arq 


Val 








18 












185 










190 






A r a 


Ar g 








Phe 


Phe 


L y s 


Thr 


Pro 


Arq 


Glu 


Va 1 


Lys 


Pro 


Pro 


Giu 






195 












200 










205 








Asp 


Leu 


Cln 


As 




Leu 


Gly 


Val 


Arq 


Phe 


Leu 


Gin 


Pro 


Phe 


Val 


Asn 


Leu 




210 












215 










220 












Ser 


Lys 






Thr 


Tyr 


Trp 


Trp 


Met 


As n 


Ala 


Phe 


He 


Lys 


Thr 


Ai a 


^ ^ ^ 

£~ ^ ^ 












230 










235 










240 


His 


Lvs 


Lvs 


Pr 




He 


Asp 


Leu 


Arq 


Ala 


He 


Ala 


Lys 


Le u 


Pro 


He 


Aia 












245 










250 










255 






Arg 


.-.a 






Thr 


Asn 


Tyr 


Gin 










Ala 


Phe 


Asp 


Aia 








26 












2 65 










27 0 








Ala 


t. r g 






Asp 


Thr 


Gin 


Ser 


Pro 


G 1 r. 


G 1 y 


Aia 


Arq 


Ala 


Ho 


Trp 


















280 










285 










Ala 


Leu 






His 


Ala 


Phe 


Glv 






Leu 


He 


Lfju 




Ser 






29C 












295 










3 00 












Ar7 


I le 






Aia 


Asp 


Leu 


Leu 


G L ■, 








Pro 




C ';z 


He 



310 H: 320 
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Fhe Gly lie Val Asp His leu Gly Lys Glu Asn iiis Val ?he Gin Pro 

325 330 335 

Lys Tnr Gin Phe Leu Giv Vai Tvr Phe Val Ger Ser Gin Gi'^ Phe Leu 

340 ' ^3^5 350 

Giy Asn Ala Tyr Val Leu Aia Vai Leu Leu Pne Leu Ala L~_i Leu Leu 

355 360 365 

Gin Arg Thr Pne Leu Gin Aia Ser Lvr Tvr i"al Ala lie Glu Thr Gly 

370 375 380 

lie Asn Leu Arq Gly Aia Tie Gin Thr Lys He Tvr Asn Lvs lie Met 
365 390 395 ' ' 400 

His Met Ser Thr Ser Asn Leu Ser Met Giy Glu Met Thr Ala Gly Gin 

405 410 415 

He Cys Asn Leu Vai Ala He Asp Thr Asn Gin Leu Met Trc Pne Pne 

420 ^ 425 430 

Pne Leu Gys Pro Asn Leu Trp Thr Met Pro Vai Gin lie lie Vai Giv 

435 440 445 

Val lie Leu Leu Tyr T-.-r He Leu Giv Vai Ser Aia Leu He Gly Aia 

450 455 460 

Aia Val He lie Leu Leu Aia Pro Val Gin T\'r Phe Vai A^a Thr Lvs 
465 470 475 430 

Lej Ser :;in A^<-: Gin Arcj Thr Thr Leu Glu His Ser Asn Glu Arg Leu 

485 490 495 

L y s Gin ~. n r Asn G i u Met Leu A r q Giv Met L \' s Leu Leu Lvs Leu : \- r 

500 " 505 ' 510 

k'^a Trp Glu Ser He Phe Cvs Sor Arg Vai Glu Val Thr Ara Arg Lvs 

515 520 525 

Giu Met Tnr Ser Leu Arg Aia Phe Ala Val Tvr Thr Ser He Ser He 

530 535 " 540 

Phe Met Asn Tnr Aia He Pro He Aia Aia Vai Leu lie Thr Phe Val 
545 550 555 560 

Giy His Val Ser Phe Phe Lvs Glu Ser Asd Leu Ser Pro Ser Vai Ala 

565 ' 570 575 

Phe Ala Ser Leu Ser Leu Phe His He Leu Vai Thr Fro Leu Phe Leu 

580 585 590 

Leu Ser Ser V^l Vai Ara Ser Thr Vai Lys Aia Leu Vai Ser Val Gin 

595 " 600 605 

Lys Leu Ser G 1 ^ Phe Leu Ser Ser Ala Giu He Arg Giu G _ u Gin Gys 

610 615 620 

Ala Pro Arq G^^ Pro Aia Pro Gin Giy Gin Aia Giy Lys I'yr Gin Ala 
625 630 635 64 0 

Val Pro Leu Lys Val Val Asn Arg Lvs Arg Pro Ala Ara Glu Giu '/al 

645 " ' 650 655 

Arg Asp Leu Leu Giy Pro Leu Gin Arg Leu Aia Pro Ser Met Asp Giy 

660 665 670 

Asp Aia Asp Asn Phe Gys Val Gin lie lie G:Ly Giy Phe Pne Thr Trp 

675 680 685 

Thr Pro Asp Gly lie Pro Thr Leu Ser Asn He Thr lie Arg He Pro 

69C 695 700 

Arg Giy Gin Leu Thr Met He Val Gly Gin Vai Gly Cys Giy Lys Ser 
70S 710 715 ' ^ 720 

Ser Leu Leu Leu Aia Thr Leu Giy Glu Met Gin Lys Vai Ser Giv Aia 

725 ^ 730 735 

Vai Phe Trp Asn Ser Asn Leu Pro Aso Ser Giu Giy Arq Giv Pro Gin 

740 745 750 

Gin Pro Arg Ale Giy Asp Ser Ser Trp Leu Gly Tvr Gin Giu Gin Arg 

755 " 760 " ^ ' 765 

Pr- Arg G^y Tyr Aia Ser Gin Lys Fro Tro Leu Leu Asn A_ a Thr Vai 

770 ' 775 ^ " 780 

Glu Giu Asn He Thr Pne Giu Ser Pre Phe Asn Pro Gin Arg Tyr Lvs 
765 790 795 s6o 



BNSDOCID: <WO 981 4571 A1J_> 



wo 98/14571 



Met: 


V3x 


lie 


Glu 


Ala 


Cys 


Ser 


Leu 


oln 


Pro 


Asp 


He 


Asp 


He 


Leu 


Pro 










805 










BIO 










815 




His 


G.y 




u ^ ri 


Tnr 


Cl:\ 


He 


Gi y 


Glu 


Arg 


Gly 


He 


Asn 


Leu 


Ser 


~' - y 








82C 










825 










830 






Gly 


Gin 


Arg 


i-r c 


Asp 


Gla 


Cys 


Gly 


Pre 


Siu 


Pro 


Ser 


Thr 


Ser 


Arg 


Pro 




335 










840 










845 








Met 


Phe 


Val 


Pr.e 


Leu 


Asp 


ASD 


Pro 


Phe 


Ser 


Ala 


Le u 


Asp 


Val 


His 


Leu 




350 










8 55 










860 










Ser 


Asp 


His 


Leu 


Met 


Gin 


Ala 


Gly 


He 


Leu 


Glu 


Leu 


Leu 


Arg 


Asp 


ASD 


865 










870 










873 










880 


Lys 


Arg 




Val 


Val 


Leu 


Val 


Thr 


His 


Lvs 


Leu 


Gin 


Tyr 


Leu 


Pro 


His 








885 










890 










895 




Ala 


Asp 


Trp 


1 1 o 


He 


Ala 


Met 


Lys 


Asp 




Thr 


He 


Gin 


Arq 


Glu 


Gly 








900 










905 










91C 






Thr 


Ls u 


L 'r S 


Aso 




Gin 


Arq 


Ser 


Glu 


C vs 


G H 


Leu 


Phe 


Glu 


His 


T rp 






915 










920 










92 5 








Lys 


Tnr 


^e u 


Met 


As n 


Arg 


Gin 


Asp 


G 1 r". 


Giu 


Le u 


G 1 u 


Lys 


Glu 


Thr 


"/al 




930 










935 










94 0 










Met 


Glu 


A r ^ 


Lvs 


Ala 


Ser 


G 1 u 


Pro 


Ser 


Gin 


Gly 


Leu 


P r o 


Ara 


Ala 


Met 


945 










950 










955 










960 


Ser 


Ser 


• • - ^ 




G 1 y 


Leu 


Leu 


..0 u 


s r ■ 


G 1 u 


G 1 .J 


1 u 


G H 


G 1 u 


Glu 


Gl J 










965 










570 










975 




hi a 


Ala 


Glu 


Ser 


Glu 


G^u 


Asp 


Asp 


A s :\ 


Leu 


So r 


S c r 


Va 1 


Leu 


His 


G-.n 








980 










985 










990 






Arg 


Ala 


uys 


He 


Pro 


Trp 


Arg 


Ala 


G vs 


Thr 


Lys 


T y r 


Leu 


Ser 


Ser 


Ala 






995 








1000 










1005 








Gly 


He 


L-u 


Leu 


Leu 


Ser 


Leu 


Leu 


val 


Phe 


Ser 


S L n 


Leu 


Leu 


Lys 


Hi.£ 


1010 








1015 










1020 










Met 


Val 


Leu 


Val 


Ala 


He 


Asp 


Tyr 


Trp 


Leu 


Ala 


Lys 


Trp 


Thr 


Asp 


Ser 


025 








1030 








1035 










1040 


Ala 


Leu 


Val 


o u 


Ser 


Pro 


Ala 


Ala 


Arq 


Asn 


Cys 


Se r 


Leu 


Ser 


Gin 


Glu 










1045 










1050 








1055 




Cys 




Lc-u 


Asp 


Gin 


Scr 


Val 


Tyr 


Ala 


Met 


Va: 


Phe 


Thr 


Leu 


Leu 


Cys 




1060 










1065 










1070 






Ser 


Leu 


Gly 


lie 


Val 


Leu 


Cys 




Val 




Ser 


V.H 


T n r 


Val 


Glu 


Trp 




1075 










1080 










1055 








Thr 


Gly 


Leu 


Ly = 


Val 


Ala 


Lys 


Arg 


Leu 


H r -3 


Ar J 


Se r 


Leu 


Leu 


Asn 


Arq 




1090 










1095 










HOC 










lie- 


lie 


Leu 


Al - 


Pro 


Met 


Arg 


Phe 


Pne 


G u 






1 ■ I ''j 


Leu 


Gly 


e r 


ICS 










1110 










111: 










1120 


He 




As n 


Arg 


Phe 


Ser 


Ser 


Asp 


C V s 


Asn 


Thr 


I le 


Asr 


G r; 


His 


He 










1125 










1130 










1135 




Pro 


Ser 


Tnr 


Leu 


Glu 


Cys 


Leu 


Ser 


Ara 


Ser 




Leu 


Leu 


Gys 


Val 


Ser 








114 0 










1145 










H5C 






Ala 


Leu 


Tr.r 


Vd 1 


He 


Ser 


Tyr 


Val 


Th.r 


Pro 


Val 


Phe 


Leu 


V.:l 


Ala 


Leu 






1155 










1160 










1165 








Leu 


Pro 


Leu 


Ala 


Val 


Val 


Cys 


Tyr 


Phe 


He 


Gin 


Lys 


Tyr 


Pne 


Arg 


Val 




117C 










1175 










118 0 










Aid 


Ser 


Arg 


ksc 


Leu 


Gin 


Gin 


Le u 


Asc 


Asp 


Thr 


Thr 


G 1 r. 


ueu 


Pro 


Leu 


185 










1190 










1195 










1200 


Val 


Ser 


Hrs 


Phe 


Ala 


Glu 


Thr 


Val 


Glu 


Gly 


Leu 


Thr 


Thr 


He 


Ara 


Ala 










1205 










1210 










1215 




Fhe 


Arq: 


-yr 


Glu 


Ala 


Arg 


?:.e 


■G - :i 


Gin 


Lys 


Leu 


Leu 


Glu 


i'/r 


■Ihr 


Asp 








i22C 










1225 










123C 






Ser 


ksn 


Asn 


He 


Ala 


Ser 


Leu 


Pne 


ueu 


Tnr 


Al^i 


Ala 


Asn 


Ar- 


T rp 


Leu 






1235 










1240 










124 5 








Glu 


Va^ 


Cys 


Met 


Glu 


Tyr 


He 


ciy 


Ala 


Cys 


V a 1 


Val 


Leu 


: le 


Ala 


Ala 




1250 










1255 










1260 










Ala 




i- c r 


He 


Ser 


Asn 


Ser 


Leu 


His 


Ara 


G 1 u 


Lo u 


Sor 


Ala 


Gly 


l^eu 



255 1270 12^5 1280 

H 
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V^l Glv leu Gly Le-J Thr Tyr Ala Leu Met V^l Ser Asn T'/r Leu Asn 

1283 1290 ' 1295 

T r n Met '.' a 1 Ar Asn Leu A - a hso Me r. G 1 u 1 1 tt Gin Le u G I y A i a '/a I 

130C ' 1305 131C 

Lys Arg l^e His Ala Leu Le\: Lv:^ Thr Giu Ai^ Glu Ser Tvt Giu Gly 

1315 1320 1325 

Leu Leu Ala Pro Ser Lc-u lie Pre Lys Asn Trn Pro Asd Gl.r. Gly Lys 

1330 1335 " 1340 

He Glr. lie Gin Asn Leu Ser Val Arg Tyr Asc Ser Ser Le« Lys Pro 
345 1350 ' 1355 1360 

Val Leu l.ys Hi 5 Val Asn T^.r Leu He Ser Pro Gly Gin Lve lie Glv 

1365 1370 ' 1375 

I-e Cys Giy Ara Thr Glv Ser Glv Lvs Ser Ser ?he Ser Leu Ala Phe 

138C 1385 139C 

Pr.e Arg Xet Val Asp Met Pne GH Glv Arq He He He Asc Gly He 

1395 1400 1405 

Asp He Al.a Lys Leu Pro Leu Wis. Tnr Leu Ara Ser Arg Leu Ser He 

1410 1^15 1420 

He Leu Gin Asc Pro Vul Lcj Pr.e Ser Gly T^.r He Ara Pr.e Asn Leu 
425 1430 1435 ' 1440 

Asp Pro Hu Lys Lys Cvs Ser /■.sc Ser Thr Leu Tro Giu Ali Leu Giu 

:44S 1450 * 1455 

Ij.e Ala S^n L.eu Lvs Lou Vu . V:^; .,vs A^a Leu Pro Glv Gl'.' Leu .^so 

14 6C 14 65 ^147: 

A^a He -ie Thr GIj Gly Gly Giu Asn Phe Ser Gin CJ y Glr. Ara Gin 

14 75 14 80 14 85 

Leu Phe Cys Leu Ala Ara Ala Phe Val Arg Lvs Thr Ser He Phe He 

1490 1495 ' 1500 

Hez Asp Giu Ala Thr Ala Ser He Asp> Met Ala Thr Giu Asn He Leu 
505 1510 * 1515 1520 

Gin Lys Val Val Met Thr Ala Phe Ala Aso Arg Thr Val Val Thr He 

1525 1530 1535 

Ala His Arg Val His Thr H^ Leu Ser Ala Aso Leu Val Met Val Leu 

1540 1545 ' 153C 

Lys Arc Sly Ala He Leu GG Phe Aso L^'S Pro Giu Thr Leu Leu Ser 

1555 1560 ' ' 1565 

Gin Lvs r-.SD Ser Val Phe .Axcj Ser Phe Val Ara Ala A.so Lvs 
1570 ISlb ' 1580 
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CLAIMS 

1 . A substantially purified polypeptide comprising the amino acid sequence of SEQ ID 
NO: 1 . or fragments thereof. 

2. An isolated and purified polynucleotide sequence encoding the polypeptide of claim 1 . 
5 3. An isolated polynucleotide sequence comprising the nucleic acid sequence of SEQ ID 

NO; 2 or variants thereof. 

4. The polynucleotide sequence of claim 3 wherein said polynucleotide comprises the 
portion of SEQ ID N0:2 extending from nucleotide T2no nucleotide A2923. 

5. An isolated polynucleotide sequence which is complementary to SEQ ID N0:2 or a 
10 variants thereof 

6. An isolated polynucleotide sequence which hybridizes under stringent conditions to 
SEQ ID N0:2. 

7. A hybridization probe comprising SEQ ID NO;2, or fragments thereof 

8. A hybridization probe comprising the polynucleotide sequence of claim 4. 

15 9. A recombinant expression vector containing the polynucleotide sequence of claim 3. 

10. A recombinant host cell containing the expression vector of 

claim 9. 

11 . A method for producing the polypeptide of SEQ ID NO: 1 . or fragments thereof the 
method comprising the steps of 

20 a) culturing the host cell of claim 10 under conditions suitable for the expression 

of the polypeptide: and 

b) recovermg the polypeptide from the host cell culture. 

12. A pharmaceutical composition comprising the polypeptide of SEQ ID NO:i in 
conjunction with a pharmaceutical ly acceptable excipient. 

25 1 3. A purified antibody which binds specifically to the polypeptide of claim 1 . 

14. A purified agonist which specifically modulates the activity of the polypeptide of 

claim 1. 

15. A purified antagonist which specifically modulates the activity of the polypeptide of 

claim 1. 

30 16. A method for the detection of polynucleotides encoding sulfonylurea receptor protein 

in a biological sample comprising the steps of 

a) hybridizing the probe of claim 7 to nucleic acid material, thereby forming a 

-58- 
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hybridization complex, and 

b) detecting said hybridization complex, wherein the presence of said complex 
correlates with the presence of a polynucleotide encoding sulfonylurea receptor protein in said 
biological sample. 

5 17. The method of claim 16 wherein before hybridization, the nucleic acid material of the 

biological sample is amplified by the polymerase chain reaction. 



-59- 
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